(12) INTERNATIONAL APPLICATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
18 January 2001 (18.01«200]) 




PCT 



iiiiiiiiiiiiiiiiiuiiiiiinii 

(10) International Publication Number 

WO 01/04270 Al 



(51) International Patent Classification^: C12N 5/10, 
15/12, 15/63, 15/64, C07K 14/47, 14^05, 14/715, QOM 
33/53. 33/567 

(21) International Application Number: PCT/US0(V19094 



(22) International Filing Date: 
(25) Filing Language: 
(2d) Publication Language: 



13 July 2000 (13.07^000) 
English 
English 



(30) PrioHtyData: 
60/143.435 



13 July 1999 (13.07.1999) US 



(71) Applicant ffor MtksiffUited States except US): GENAIS- 
SANCE PHARMACEUTICALS, INC. [US/US]; Five 
Science Pafk, New Have«, CT 065 1 1 (US). 

(72) Inventors; and 

(7^ Inventors/Applicants ffbr US onfy)i CHEW, Anne 
[US/US]; 837 Orange Street, Apanment 2, New Haven, 
CT06S1 1 (US). DENTON, iU Rex [US/US]; 129 Hunters 
Trail, Madison, CT 06443 (US). DUDA, Amy [USAJS]; 
93 Marvel Road, New Haven, CT 06515 (US). NAND- 
ABALAN, Krishnan [IN/US]; 22g Village Pond Road, 
Guilfoid, CT 06437 (US). STEPHENS, Joel, Claiborne 
lUSAJSJ; 46 Crabapple Lane, Guilford. CT 06437 (US). 
WINDEMUTH, Andreas, K. [US/US]; 91 Center Road, 
Woodbridge, CT 06524 (US). 



(74) Agent: HENDERSON, Melodic, >V.; Genaissance Phar- 
maceuticals, Inc., Five Science Park, New Haven, CT 
06511 (US). 

(81) Designated States (national)*- AE, AG, AL, AM, AT. AU, 
AZ, BA, BB, BG. BR. BY, BZ, CA. CH, CN, CR, CU, CZ. 
DE, DK, DM, DZ, EE, ES. FI. GB. GD, GE, GH. GM, HR. 
HU. ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ. LC, LK, LR, 
LS, LT, LU. LV. MA, MD, MG, MK, MN, MW, MX, MZ. 
NO, NZ, PL, PT, RO. RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US. UZ. VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patem (AT, BE, CH, CY, DE. DK. ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 
CJ. CM, GA, GN. GW, ML, MR, NE, SN, TD, TG). 

Published: 

— With international search report. 

— Before the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations ** appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 



(54) Title: DRUG TARGET ISOGENES: POLYMORPHISMS IN THE INTERLEUKIN 4 RECEPTOR ALPHA GENE 

O (57) Abstract: Polynucleotides comprising one or more of 38 novel single nucleotide polymorphisms in the human Inierleukin 4 
^ Receptor Alpha (IL4Ra) gene are described. Compositions and methods for detecting one or more of these polymorphisms are also 
^ disclosed. In addition, various genotypes and haplotypes for lL4Ra gene that exist in the popidadon are described. 



wo 01/04270 



PCT/USOO/19094 



DRUG TARGET ISOGENES: 
POLYMORPfflSMS IN THE INTERLEUKIN 4 RECEPTOR ALPHA GENE 
RELATED APPLICATIONS 

This application is a continuation-in-part of, and claims priority to, U.S. Provisional Application 
5 Serial No. 60/143,435 filed July 13. 1999. 

FIELD OF THE INVENTION 

This invention relates to variation in genes that encode pharmaceuticaU^ In 
particular, tiiis invention provides genetic variants of the human Inteileulcin 4 Receptor Alpha (IL4Ra) 
1 0 gene and methods for identifying which variant(s) of this gene is/are possessed by an in^vidual. 

BACKGROUND OF THE INVENTION 

Curr^t methods for identifying pharmaceuticals to treat disease often start by identifying, 
cloning, and expressing an important target protein related to the disease. A determination of whether an 

1 5 agonist or antagonist is needed to produce an effect that may benefit a patient with the disease is then 
made. Then, vast numbers of compounds are screened against the target protein to find new potential 
drugs. The desired outcome of this process is a drug that is specific for the target, thereby reducing the 
incidence of the undesired side effects usually caused by a compound's activity at non-intended targets. 
What this approach &ils to consider, however, is that natural variability exists in any and every 

20 population with respect to a particular protein. A target protein currently used to screen drugs typically is 
expressed by a gene cloned fix>m an individual who was arbitrarily selected. However, the nucleotide 
sequence of a particular gene may vary tremendously among individuals. Subtle alteration(s) in the 
primary nucleotide sequence of a gene encoding a target protein may be manifested as significant, 
variation in expression of or in the structure and/or fimction of the protein. Such alterations may explain 

25 the relatively high degree of uncertainty inherent in treatment of individuals with drugs whose design is 
based upon a single representative example of the target. For example, it is well-estabhshed that some 
classes of drugs firequentiy have lower efficacy in some individuals than others, which means such 
individuals and their physicians must weigh the possible benefit of a larger dosage against a greater risk 
of side effects. In addition, variable information on the biological fimction or efifects of a particular 

30 protein may be due to different scientists unknowingly studying di£ferent isofomis of the gene encoding 
the protein. Thus, information on the type and fiequency of genomic variation that exists for 
pharmaceutically important proteins would be useful 

The organization of single nucleotide variations (polymorphisms) in the primary sequence of a 
gene into one of the limited number of combinations that exist as units of inheritance is termed a 

35 haplotype. Each haplotype therefore contains significantiy more iriformation than individual unorganized 
polymorphisms. Haplotypes provide an accurate measurement of the genomic variation in the two 
chromosomes of an individual. 
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It is well-established that many diseases are associated with specific variations in gene sequences. 
However while there are examples in which individual polymorpihisms act as gen^c markers for a 
particular phenotype, in other cases an individual polymorphism may be found in a variety of genomic 
backgrounds and therefore shows no definitive coupling between the polymorphism and the causative site 
5 for the phenotype (Clark AG et al 1998 Am J Hum Genet 63:595-612; Ulbrecht M et al. 2000 Am J 

RespirCrit Care Med 161: 469-74). In addition, the marker may be predictive in some populations, but 
not in other populations (Claik AG et al 1998 supra). In these instances, a haplotype will provide a 
superior genetic marker for the phenotype (Clark AG et al. 1998 supra; Ulbrecht M et al. 2000, supra; 
Ruaiio G & Stephens JC Gen EngNews 19 (21), December 1999). 

10 Analysis of the association between each observed haplotype and a particular phenotype permits 

ranking of each haplotype by its statistical power of prediction for the phenotype. Haplotypes foimd to be 
strongly associated with the pheaiotype can then have that positive association confirmed by alternative 
methods to minimize false positives. For a gene suspected to be associated with a particular phenotype, if 
no observed haplotypes for that gene show association with the phenotype of interest, then it may be 

15 inferred that variation in the gene has little, if any, involvement with that phenotype (Ruafio & Stephens 
1999, supra). Thus, information on the observed haplotypes and their fi*equency of occurrence in various 
population groups will be useful in a variety of research and clinical applications. 

One possible drug target for the treatment of allergies, asthma, and other immune responses is the 
Interleukin 4 Receptor Alpha (EL4Ra) gene or its encoded product. IL4R is a transmembrane complex , 

20 composed of two different protein subunits, a 140-kDa high affinity binding subunit named interleukin-4 
receptor a (IL-4Ra; also known as CD 124 antigen) and either a gamma-c subunit, which is present in 
several cytokine receptors, or an interl6ukin-13 receptor 1 (IL-13R1) subunit. Both subunits of the IL-4R 
are required to bind interleukin'4 (IL^) and to mediate its transcription-activating effects through the 
tyrosine kinases, Jakl and Jak3. Upon bmding of 11^4 to the II^4R, Jak 1 and Jak 3 phosphorylate the IL- 

25 4Ra subunit, creating binding sites in the cytoplasmic domain for many other proteins, including SOS, 
Stat-6, c-fes, and src homology phosphatase 1 (SHP-l). Activated Jak proteins also phosphorylate Stat- 
proteins, which travel into the nucleus and function as transcription factors. Other IL-4 signal pathways 
exist, but are less well characterized. 

IL-4, one of the most important cj^okines involved in the allergic response, is produced when 

30 cells from the immune system, in particular T cells, are activated in response to an allergen. Regulation 
of the immune response involves Helper T-cells that differentiate into two subtypes, Thl and Th2. Thl 
cells express interferon-gamma and interluekin-2 (IL-2), and mediate a cell-based immunity, where 
. macrophages and neutrophils are prominently involved. Thl cells also direct the IgB-producing B cells, 
as well as mast cells, basophils and eosinophils. Th2 cells produce IL-4, 11^5, IL-6, IL-10 and IL-13. 

35 . Each Th cell subtype represses the other, so the immune system is forced into differentiation into either a 
Thl or Tli2 response against an external allergic challenge. In many instances an aberration of this 
response can render a pathological state such as a Th2 response against ragweed. 
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IL-4 induces in B cells the synthesis of IgE type antibodies that recognize specific allergens. IgE 
binds to receptors on mast cells and basophils and mediates the early humoral (sub-chronic) response on 
the B-side of the immune system. If an allergen binds mast cell-attached IgE, the mast cell releases 
mediators like histamine, and the eicosanoid leukotrienes and prostaglandins products, some of which 
5 cause the familiar symptoms of an acute allergic leaction: swelling, itching, mucous, and reddening of the 
skin. Later in this process eosinophils and other inflammatory cells migrate to the site of inflammation. 
This later phase is important in asthma, because the eosinophils may instigate a more chronic 
inflammation which can adversely scar lung tissue. IL-4 is at least partly responsible for recruiting 
eosinophils, because it induces synthesis of specific adhesion molecules on the capillary endothelium, and 
10 stimulates expression of IL-S and eotaxin. IL-5 leads to the development of a large number of 

eosinophils finom precursor cells in the bone marrow, and eotaxin'stimulates their migration into the lung 
tissue. 

It has been proposed that inhibition ofJLr4 activity would disarm the Th2 compoiient of the 
immune system. This would then allow the inomune system to develop a natural tolerance towards 

15 common allergens without the full acute response to the challenge. In this way, tolerance may be induced 
in many patients, similar to what is sometimes achieved with hyposensitization shots for allergy patients. 
Thus, substances that inhibit IL-4 production and/or its binding to the IL-4 receptor, may improve the 
therapy of allergies and asthma. 

The gene for IL-4Ra has eleven exons encoding an 825 amino acid protein and spans over 24 kb 

20 of the short arm of chromosome 16 (16pl2.1) (Pritchard et aL, Genomics 10:801, 1991; GenBank 

Accession No. AC004525). A reference sequence for II^R-kx gene, which corresponds to the reverse 
complement of nucleotides 100020-71331 ui the GenBank Accession No. AC004525, is shown in Fig. 1 
(S£Q ID NO: 1). Reference sequences for IL-4Ra mRNA (GenBank Accession No. NM_00041 8) and 
the cmcoded IL4Ra precursor protein (GenBank Accession No. P24394) are ishown in Figs. 2 and 3, 

25 respectively (SEQ ID N0S:2 and 3). Significant features r^orted for the IL-4Ra precursor include: a 
signal pq>tide located between a.a. 1 and 25; an extracellular domam between a.a. 26 and 232; disulfide 
bonds between a.a. 34 and 44 and between a.a. 74 and 86; glycosylation sites at amino acids 53, 98, 128, 
134, 1 76, and 209; a transmembrane region between a.a 233 and 256; and a cytoplasmic domain between 
a.a. 257 and 825. 

30 Recently, several studies have suggested that genetic polymorphisms in the IL4-Ra gene are 

associated with genetic predisposition to atopy and/or elevated senun IgE. Mitsujrasu et al., reported that 
polymorphisms at codons 75 and 576 affect IL^R fimction (Nat Genet 19:1 19-120, 1998). The IL-4Ra 
allele with isoleucine at amino acid position 75 (Ile75) in the extracellular domain is more responsive to 
IL-4 than the allele with valine at that position (Val75) and is associated with atopic asthma but not with 

35 non-atopic asthma (Mitsuyasu et al., supra). Also, the allele with arginine at position 576 (Arg576) in the 
cytoplasmic domain exhibits higher receptor activity than the glutamine allele (Glu576) due to reduced 
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binding by the Arg576 allele of a negative regulatory molecule, ^ homology pho^hatase 1 (Imani et al., 
J\ Biol Chem. 272:7927-793 1, 1997). The Arg576 allele has a higher finequency in patients with allergic 
inflammatory disorders, including atopy (Khurana Hershey et al.. New Eng. J, Med 337: 1 720- 1 725, 
1997). In a recent study, Hersh^ et al.; (WO 00/34789) reported that the Arg576 aUele is significantly 
5 associated with asthma. Studies showed that patients who were homozygous for this allele had about a 9- 
. fold higher risk towards asthma and that two copies of Arg576 are associated in an increase in asthma 
prevalence and severity. Kanemitsu et al. recently reported that the presence of either the Ee75 or Arg576 
variant is significantly associated with susceptibility for developing systemic lupus erythematosus (SLE), 
a Th2-dominant systemic autoimmune disorder (Arthritis Rheum. 42: 1298-1300), Another variant IL- 

10 4Ra allele that is reportedly associated with atopy susceptibility has proline rather than serine at position 
503 (Kruse et al., Immunol. 96:365-371, 1999). Other ILr4Ra gene polymorphisms leading to amino acid 
changes in the cytoplasmic domain of the protein product have been id^ified at codons 400 (E400A), 
431 (C431R) and 786 (S786P) (Kruse et al., supra; Deichtnann et al., Biochem. Biophys, Res. Commun. 
231 :696-697, 1997). A polymorphism of guanine or ad^iine at a position corresponding to nucleotide 

1 5 55328 in Figure 1 has also been reported as well as a polymorphism of cytosine or thymine at a position 
corresponding to nucleotide 55430 (Buetow et al., 1999, Nat Genet. 21 :323-5). 

Because of the potential for polymorphisms in the IL4Ra gene to affect the expression and 
function of the encoded protein, it would be useful to determine whether additional polymorphisms exist 
in the IL4Ra gene, as well as how such pol3miorphism$ are combined in different copies of the gene. 

20 Such information would be useful for studying the biological function of IL4Ra as well as in identifying 
drugs targeting this protein for the treatment of disorders related to its abnormal expression or function. 

SUMMARY OF THE INVENTION 

Accordingly, the inventors herein have discovered 38 novel polymorphic sites in the IL4Ra gene. 

25 These polymorphic sites (PS) correspond to the following nucleotide positions in the reverse compl^ent 
of the indicated GenBank Accession Number 32884 (PSl). 32903 (PS2), 329(51 (PS3), 33 135 ^S4), 
35763 (PS6), 35770 (PS7), 35817 (PS8), 35905 (PS9), 35944 (PSIO), 35958 (PSl 1), 37330 (PS12), 
37473 (PS13), 37586 (PS14), 37591 (PS15), 37604 (PS16), 37644 (PS17). 37678 (PS18), 43446 (PS19), 
43703 (PS20), 53008 (PS21), 53099 (PS22), 53153 (PS23), 53456 (PS25), 53507 (PS27), 53513 (PS28), 

30 53915 (PS30), 53949 (PS32), 54237 (PS33), 54468 (PS34), 5461 1 (PS35), 54698 (PS36), 54700 (PS37), 
54741 (PS38), 54780 (PS39), 55083 (PS40), 55142 (PS41), 55539 (PS44) and 55758 (PS45) in 
AC004525. The polymorphisms at these sites are adenine or guanine at PSl , cytosine or thymine at PS2, 
guanine or thyniine at PS3, guanine or cytosine at PS4, cytosine or thymine at PS6, guanine or adenine at 
PS7, thymine or cytosine at PS8, cytosine or thymine at PS9, cytosme or thymine at PSIO, guanine or 

'35 adenine at PSl 1, guanine or adenine at PS12, cytosine or thymine at PS13, cytosme or thymine at PS14, 
guanine or adenine at PS15, adenine or thymine at PS16, cytosine or adenme at PS17, cytosine or 
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thymine at PS18, guanine or adenine at PS19, thymine or cytosine at PS20, adenine or cytosine at PS21, 
cytosine or thymine at PS22, thymine or cytosin at PS23, guanine or thymine at PS25» cytosine or 
thymine at PS27, thymine or cytosine at PS28» cytosine or thymine at PS30, guanine or adenine at PS32, 
cytosine or thymine at PS33, thymine or guanine at PS34, thymine or cytosine at PS35, thymine or 
5 cytosine at PS36, thymine or cytosine at PS37, cytosine or thymine at PS38, cytosine or guanine at PS39, 
adenine or guanine at PS40, guanine or adenine at PS41, cytosine or thymine at PS44 and guanine or 
adenine at PS45. In addition, the inventors have detennined the identity of the alternative nucleotides 
present at these sites, as well as at the previously identified sites at nucleotides 35749 (PS5), 5341 3 
(PS24), 53505 (PS26), 53721 (PS29), 53941 (PS31), 55328 (PS42), and 55430 (PS43). It is beUeved that 

1 0 IL4Ra-encoding polynucleotides containing one or more of the novel polymorphic sites reported herein 
will be useful in stud3dng the expression and biological function of IL4Ra, as well as in developing drugs 
targeting this protein. In addition, information on the combinations of polymoiphisms in the IL4Ra gene 
may have diagnostic and forensic implications. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 

15 nucleotide sequence which is a polymorphic variant of a reference sequence for the IL4Ra gene or a 
•ixagment thereof. The reference sequence comprises SEQ ID NO: 1 and the polymorphic variant 
comprises at least one polymorphism selected fiom the group consisting of guanine at PSl, thymine at 
PS2, thymine at PS3, cytosine at PS4, thymine at PS6, adenine at PS7, cytosine at PS8, thyniine at PS9, 
thymine at PSIO. adenine at PSl 1, adenine at PS12, thymine at PS13, thymine at PS14, adenine at PS15, 

20 thymine at PS 1 6, adenine at PS 1 7, thymine at PS 1 8, adenine at PS 1 9, cytosine at PS20, cytosine at PS2 1 , 
thymine at PS22, cytosine at PS23, thymine at PS25, thymine at PS27, cytosine at PS28, thymine at PS30, 
adenine at PS32, thymine at PS33, guanine at PS34, cytosine at PS35, cytosine at PS36, cytosine at PS37, 
thymine at PS38, guanine at PS39, guanine at PS40, adenine at PS41, thymine at PS44, and adenine at 
PS45. In a preferred embodiment, the polymorphic variant comprises one or more additional 

25 polymorphisms selected fix>m the group consisting of guanine at PS5, cytosine at PS24, cytosine at PS26, 
cytosine at PS29, guanine at PS3 1 , adenine at PS42, and thymine at PS43. A particularly preferred 
polymorphic variant is a naturally-occurring isofbim (also referred to herein as an ^'isogene*^) of the 
IL4Ra gene. An IL4Ra isogene of the invention comprises adenine or guanine at PSl, cytosine or 
thymine at PS2, guanine or thymine at PS3, guanine or cytosine at PS4, cytosine or thymine at PS6, 

30 guanine or adenine at PS7, thymine or cytosine at PS8, cytosine or thymine at PS9, cytosiue or thymine at 
PSIO. guanine or adenine at PSl 1, guanine or adenine at PS12, cytosine or thymine at PS13, cytosine or 
thymine at PS 1 4, guanine or adenine at PS 1 5 , adenine or thymine at PS 1 6, cytosine or adenine at PS 1 7, 
c3rtosine or thymine at PS 18, guanine or adenine at PS 19, thymine or cytosine at PS20, adenine or 
cytosine at PS2 1 , cytosine or thymine at PS22, thymine or cytosine at PS23, guanine or thymine at PS25, 

35 cytosine or thymine at PS27, thymine or cytosine at PS28, cytosine or thymine at PS30, guanine or ■ 

adenine at PS32, cytosine or thymine at PS33, thymine or guanine at PS34, thymine or cytosine at PS35, 
thymine or cytosine at PS36, thymine or cytosme at PS37, cytosine or thymine at PS38, cvtosine or 
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guanine at PS39, adenine or guanine at PS40» guanine or adenine at PS41, cytosine or thymine at PS44 
and guanine or adenine at PS45. The invention also provides a collection of IL4Ra isogenes, referred to 
herein as an IL4Ra genome anthology. 

An IL4Ra isogene may he defined hy the combination and order of these polymoiphisms in the 
isog^e, which is referred to herein as an IL4Ra haplotype. Thu$» the invention also provides data on the 
number of different IL4Ra haplotypes found in the above four population groups. This haplotype data is 
useful in methods for deriving an IL4Ra haplotype from an individual's genotype for the IL4Ra gene and 
for determining an association between an IL4Ra haplotype and a particular trait. 

In another embodiment, the invention provides a poly^ucleot^de comprising a polymorphic 
variant of a reference sequence for an IL4Ra cDNA or a ftagment thereof. The reference sequence 
comprises SEQ ID NO:2 (Fig, 2) and the polymorphic cDNA comprises at least one polymorphism 
selected from the group consisting of thymine at a position corresponding to nucleotide 237, adenine at a 
position corresponding to nucleotide 244, cytosine at a position corresponding to nucleotide 291, thymine 
at a position corresponding to nucleotide 501, adenine at a position corresponding to nucleotide 554, . 
cytosine at a position corresponding to nucleotide 939, thymine at a position corresponding to nucleotide 
1242, thymine at a position corresponding to nucleotide 1293, cytosine at a position corresponding to 
nucleotide 1299, thymine at a position corresponding to nucleotide 1701, adenine at a position 
corresponding to nucleotide 1 735, thymine at a position corresponding to nucleotide 2023, guanine at a 
position corre^nding to nucleotide 2254 and cytosine at a position corresponding to nucleotide 2397 . 
In a preferred embodiment, the polymorphic variant comprises one or more additional polymorphisms - 
selected from the group consisting of guariine at a position corresponding to 223, cytosine at a position 
corresponding to nucleotide 1 199, cytosine at a positiori corresponding to 1291, cytosine at a position 
corresponding to nucleotide 1507 and guanine at a position corresponding to 1737. 

Polynucleotides complementary to these IL4Ra genomic and cDNA variants are also provided by 
the invention. 

In other embodiments, the invention provides a recombinant expression vector comprising one of 
the polymorphic genomic variants operably linked to eicpre'ssion regulatory elements as well as a 
recombinant host cell transformed or transfected with the expression vector. The recombinant vector and 
host cell may be used to express IL4Ra for protein structure analysis and drug binding studies. 

In yet ailother embodiment, the invention provides a polypeptide comprising a polymorphic 
variant of a reference amino acid sequence for the IL4Ra protein. The refi^race amino add sequence 
comprises SEQ ID NO;3 (Fig, 3) and the polymorphic variant comprises at least one variant amino acid . 
selected from the group consisting of threonine at a position corresponding to amino acid 82, histidine at 
a position corresponding to amino acid 185, isoleucine at a position corresponding to amino acid 579, 
serine at a position corresponding to amino acid 675, and alanine at a position corresponding to amino 
acid 752. In some embodiments, the polymorphic variant also comprises at least one variant amino acid 
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selected from the group consisting of valine at a position coiresponding to amino acid 75, alanine at a 
position corresponding to amiifo acid 400» arginine at a position correpsonding to amino acid 431 , proline 
at a position corresponding to amino acid 503, and arginine at a position corresponding to amino acid 
576. A polymorphic variant of IL4Ra is useful in studying the effect of the variation on the biological 
5 activity of IL4Ra as well as studying the binding affinity of candidate drugs targeting IL4Ra for the 
treatment of allergies, asthma, and other immune responses. 

The present invention also provides antibodies that recognize and bind to the above polymorphic 
lL4Ra protein variant. Such antibodies can be utilized in a variety of diagnostic and prognostic formats 
and therapeutic methods. 

10 In other embodiments, the invention provides methods, compositions, and kits for haplotyping 

and/or genotyping the IL4Ra gene in an individual. The methods involve identifying the nucleotide or 
nucleotide pair present at one or more polymorphic sites selected from PSl , PS2, PS3, PS4, PS6, PS7, 
PS8, PS9, PSIO, PSl 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, 
PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35. PS36, PS37, PS38. PS39, PS40. PS41, PS44, and 

15 PS45 in one or both copies of the IL4Ra gene from the individual. The compositions contain 

oligonucleotide probes and primers designed to specifically hybridize to one or more target regions 
containing, or that are adjacent to, a polymorphic site. The methods and compositions for establishing the 
' genotype or haplotype of an individual at the novel polymorphic sites described herein are useful for 
studying the effect of the polymorphisms in the etiology of diseases affected by the expression and 

20 function of the IL4Ra protein, studying the efficacy of drugs targeting IL4Ra, predicting individual 
susceptibility to diseases affected by the expression and function of the IL4Ra protein and predicting 
individual responsiveness to drugs targeting IL4Ra. 

In yet another embodiment, the invention provides a method for identifying an association 
between a genotype or haplotype and a tmit. In preferred embodiments, the trait is susceptibility to a 

25 disease, severity of a disease, the staging of a disease or response to a drug. Such methods have 

applicability in developing diagnostic tests and therapeutic treatments for allergies, asthma, and other 
inmiune responses. 

The present invention also provides transgenic animals comprising one.of the IL4Ra genomic 
polymorphic variants described herein and methods for producing such animals. The transgenic animals 
30 are useful for studying expression of the IL4Ra isogenes in vivo^ for in vivo screening and testing of 
drugs targeted against IL4Ra protein, and for testing the efficacy of therapeutic agents and compounds 
for allergies, asthma, and other inunune responses in a biological system. 

The present invention also provides a computer system for storing and displaying polymorphism 
data determined for the IL4Ra gene. The computer system comprises a computer processing unit; a 
35 display; and a database containing the polymorphism data. The polymorphism data includes the 
polymorphisms, the genotypes and the haplotypes identified for the IL4Ra gene in a reference 
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population. In a prefeired embodiment, the computer system is capable of producing a display showing 
IL4Ra haplotypes organized according to their evolutionary relationships. 

BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 illustrates a reference sequence for the IL4Ra gene that is the reverse complement of 

part of Genbank Accession Number AC004525.1; contiguous lines; SEQ ID NO:l)» with the underlines 
indicating the start and stop codons, shading indicating the reference coding sequence, and the bold 
nucleotides indicating the polymorphic sites and polymorphisms identified by Applicants in a reference 
population. 

10 Figure 2 illustmtes a reference sequence for the IL4Ra coding sequence (GenBank Accession 

Number X52425; contiguous lines; SEQ ID NO:2), with the underlines indicating the start and stop 
codons, and the bold nucleotides indicating the polymorphic sites and polymorphisms identified by 
Applicants in a reference population. 

Figure 3 illustrates a reference sequence for the IL4Ra protein (GenBank Accession Number 

15 CAA36672; contiguous lines; SEQ ID N0:3), with the bold amino acids indicating the amino acid 
variations caused by the polymorphisms of Fig. 2. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention is based on the discovery of novel variants of the IL4Ra gene. As 

20 described in more detail below, the inventors herein discovered 38 novel polymorphic sites by 

characterizing the IL4Ra gene found in genomic DNAs isolated from Index Repository lA that contains 
immortalized cell lines from one chimpanzee and 93 human individuals and Index Repository IB that 
contains 70 human individuals. Theses two repositories contain 5 1 individuals in common. 

The human individuals in Index Repository lA included a reference population of 79 unrelated 

25 individuals self-identified as belonging to one of four major population groups: Caucasian (22 

individuals), African descent (20 individuals) Asian (20 individuals) Ifispanic/Latino (17 individuals). 
To the extent possible, the members of this reference population were organized into population 
subgroups by the self-identified ethnogeographic origin of their four grandparents as shown in Table 1 
below. In addition. Index Repository lA contains three unrelated indig^ous American Indians (one from 

30 each of North, Central, and South America), one three-generation Caucasian family (From the CEPH 
Utah cohort) and one two-generation African-American family. 
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Table 1. Population Groups in Index Repository lA 





Pomilation Subffroun 


Wo of Tndi vt duals 


•ciLiiivaii. ucova^ui* 




20 




Sierra T^ef>ii6 


1 




/Vsian * 




20 




Rurtna 


\ 




wiiliia 


1 




Japan 






JVVil WW 


1 

1 





'PK 1 1 itmi ti 


< 

J 




V JlCUuUJl 


A 
*t 


waUCoStlaJl 








JDlllloU laiCS 


'I 






A 






I 




Central/Eastern .. 


1 




Eastern 


3 




Central/Mediterranean 


1 




Mediterranean 


2 




Scandinavian 


2 


Hispanic/Latino 




17 




Caribbean 


7 




Caribbean (Spanish Descent) 


2 




Central American (Spanish Descent) 


1 




Mexican American 


4 




South American (Spanish Descent) 


3 



Index Repository IB contains a reference population of 70 human individuals comprised of 4 
three-generation families (ftom the CEPH Utah cohort) as well as umrelated African-American, Asian, 
and Caucasian individuals. A total of 38 individuals in this reference population are unrelated. 

Using the IL4Ra genotypes identified in the Index Repositories and the methodology described 
in the Examples below, the inventors herein also determined the haplotypes found on each chromosome 
for most human members of this repository. The IL4a genotypes and haplotypes found in the Index 
Repositories include those shown in Tables 4 and 5> respectively. The polymorphism and haplotype data 
disclosed herein are useful for studying population diversity, anthropological lineage, the significance of 
diversity and lineage at the phenotypic level, paternity testing, forensic applications, and for identifying 
associations between the IL4Ra graetic variation and a trait such as level of drug response or 
susceptibility to disease. 

In the context of this disclosure, the following t^ms shall be defined as follows unless otherwise 
indicated: 

Allele - A particular form of a genetic locus, distinguished from other forms by its particular 
nucleotide sequence. 

Candidate Gene - A gene which is hypothesized to be responsible for a disease, condition, or the 
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response to a treatment, or to be coirelated with one of thes^ 

Gene - A segment of DNA that contains all the information for the regulated biosynthesis of an 
RNA product, including promoters, exons, introns, and other untranslated regions that control expression. 

Genotype - An unphaised 5 ' to 3 ' sequence of nucleotide pair($) found at one or more 
5 polymorphic sites in a locus on a pair of homologous chromosomes in an individual. As used herein, 
genotype includes a full-genotype and/or a sub-genotype as described below. 

Full-genotype — The imphased 5 ' to 3 ' sequence of nucleotide pairs found at all known 
polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 

Sub-genotype - ITie unphased 5 ' to 3 ' sequence of nucleotides seen at a subset of the known 
1 0 polymorphic sites in a locus on a pair of homologous chromosomes in a single individual. 

Genotyping - A process for determining a genotype of an individual. 

Haplotype - A 5' to 3' sequence of nucleotides found at one or more polymorphic sites in a locus 
on a single chromosome from a single individual. As used herein, haplotype includes a jEull-h^lotype 
and/or a sub-haplotype as described below. 
15 Fnll-haplotype - The 5' to 3" sequence of nucleotides found at all known polymorphic sites in a 

locus on a single chromosome fi*om a single individual. 

Sub-haplotype - The 5 ' to 3 ' sequence of nucleotides seen at a subset of the known polymorphic 
sites in a locus on a single chromosome from a single individual. 

Haplotype pair ~ The two haplotypes found for a locus in a single individual. 
20 Haplotyping - A process for determining one or more haplotypes in an individual and includes 

use of femily pedigrees, molecular techniques and/or statistical inference. 

Haplotype data - Infonnation concerning one or more of the following for a specific gene: a 
listing of the haplotype pairs in each individual in a population; a listing of the different haplotypes in a 
population; frequency of each haplotype in that or other populations, and any known associations between 
25 one or more haplotypes and a trait. 

Isoform - A particular form of a gerie, mRNA, cDNA or the protein encoded thereby, 
distinguished from other forms by its particular sequence and/or structure. 

Isogene - One of the isoforms of a gene found in a population. An isogetie contains all of the 
polymorphisms present in the particular isoform of the gene. 
30 Isolated - As applied to a biological molecule such as RNA, DNA, oligonucleotide, or protein, 

isolated means the' molecule is substantially free of other biological molecules such as nucleic adds, 
proteins, lipids, carbohydrates, or other material such as cellular debris and growth media. Generally, the 
term "isolated" is not intended to refer to a complete absence of such material or to. absence of water, 
buffers, or salts, unless they are present in amotmts that substantially interfere with the methods of the 
35 present invention. 

Locus - A location on a chromosome or DNA molecule corresponding to a gene or a physical or 
phenolypic feature. 

10 
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NatnraUy- ccurring - A term used to designate that the object it is applied to, e.g., naturally- 
occurring polynucleotide or polypeptide, can be isolated from a source in nature and which has not been 
intentionally modified by msCn. 

Nucleotide pair - The nucleotides fourid at a polymorphic site on the two copies of a 
chnimosome from an individual. 

Phased — As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a 
locus, phased means the combination of nucleotides present at those polymorphic sites on a single copy of 
the locus is known. 

Polymorphic site (PS) - A position within a locus at which at least two alternative sequences axe 
found in a population, the most frequent of which has a frequency of no more than 99%. 

Polymorpiiic variant - A gene, mRNA, cI>NA, polypeptide or peptide whose nucleotide or 
amino acid sequence varies fix>m a reference sequmce due to the presotice of a polymorphism in the gene. 

Polymorpliisni - The sequence variation observed in an individual at a polymorphic site. 
Polymorphisms include nucleotide substitutions, insertions, deletions and microsatellites and may, but 
need not, result in detectable differences in gene e?q>ression or protein function. 

Polymorphism data - Information concerning one or more of the following for a specific gene: 
location of polymorphic sites; sequence variation at those sites; frequency of polymorphisms in one or 
more populations; the different genotypes and/or haplotypes determined for the gene; frequency of one or 
more of these genotypes and/or haplotypes in one or more populations; any known association(s) between 
a trait and a genotype or a haplotype for the gene. 

Folymorpliisiii Database - A collection of polymorphism data arranged in a systematic or . 
methodical way and capable of being individually accessed by electronic or other means. 

Polynucleotide - A nucleic acid molecule comprised of single-stranded RNA or DNA or 
comprised of complementary, double-stranded DNA. 

Population Group - A group of individuals sharing a common ethnogeographic origin. 

Reference Population - A group of subjects or individuals who are predicted to be rqiresentative 
of the genetic variation found in the general population. Typically, the reference population represents 
the genetic variation in the population at a certainty level of at least 85%, preferably at least 90%, more 
preferably at least 95% and even more preferably at least 99%. 

Single Nucleotide Polymorphism (SNP) - Typically, the specific pair of nucleotides obs^ved at 
a single polymorphic site. In rare cases, three or four nucleotides may be foimd. 

Subject - A human individual whose genotypes or haplotypes or response to treatment or disease 
state are to be determined. 

Treatment - A stimulus administered internally or externally to a subject. 

Unphased - As applied to a sequence of nucleotide pairs for two or more polymorphic sites in a 
locus, unphased means the combination of imcleotides present at those polymorphic sites on a singite copy 
of the locus is not known. 

11 



wo 01^04270 PCTAJSOO/19094 

The inventors hmm have discovered 38 novel polymorphic sites » and confirmed the existence of 
7 other sites, in th IL4Ragene. The polymorphic sites identified by the inventors are referred to as PS 1- 
45 to designate the order in which they are located in the gene (see Table 3 below)» with the novel 
polymorphic site referred to as PSl, PS2, PS3, PS4, PS6, PS7, PS8, PS9, PSIO, PSl 1. PS12, PS13, PS14, 
5 PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34, 
PS35, PS36, PS37, PS38, PS39, PS40, PS41, PS44, and PS45 and the previously reported polymorphic 
sites referred to as PS5, PS24, PS26, PS29, PS3 1. PS42, and PS43. 

Thus, in one embodiment, the invention provides an isolated polynucleotide comprising a 
polymorphic variant of the IL4Ra gene or a fragment of the gene which contains at least one of the novel 

10 polymorphic sites described herein. The nucleotide sequence of a variant IL4Ra gene is identical to the 
reference genomic sequence for those portions of the gene examined, as described in the Examples below, 
except that it comprises a different nucleotide at one or more of the novel polymorphic sites PSl » PS2, 
PS3, PS4, PS6, PS7, PS8, PS9, PSIO, PS 11, PS 12. PS13, PS 14, PS15, PS16, PS17, PS18, PS19, PS20, 
PS21, PS22. PS23. PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35, PS36, PS37, PS38, PS39, PS40, 

15 PS41, PS44, and PS45,and may also comprise one or more additional polymorphisms selected from the 
group consisting of PS5, PS24, PS26, PS29, PS31, PS42, and PS43. Similarly, the nucleotide sequence 
of a variant fragment of the IL4Ra gene is identical to the corresponding portion of the reference 
sequence except for having a different nucleotide at one or more of the novel polymoxphic sites described 
herein. Thus, the invention specifically does not include polynucleotides comprising a nucleotide 

20 sequence identical to the reference sequence (or other reported lL4Ra sequences) or to portions of the 
reference sequence (or other reported IL4Ra sequences), except for genotyping oligonucleotides as 
described below; 

The location of a polymorphism in a variant gene or fragment is identified by aligning its 
sequence against SEQ ID NO: 1 . The polymorphism is selected from the group consisting of guanine at 

25 PSl, thymine at PS2, thymine at PS3, cytosine at PS4, thymine at PS6, adenine at PS7, cytosine at PS8, 
thymine at PS9, thymine at PSIO, adenine at PSl 1, adenine at PS12, thymine at PS13, thymine at PSI4, 
adenine at PS15, thymine at PS16, ad^iine at PS17, thymine at PS18, adenine at PS19, cytosine at PS20, 
cytosine at PS21, thymine at PS22, cytosine at PS23, thymine at PS25, thymine at PS27, cytosine at 
PS28, thymine at PS30, adenine at PS32, thymine at PS33, guanine at PS34, cytosine at PS35, cytosine at 

30 PS36, cytosine at PS37, thymine at PS38, guanine at PS39, guanine at PS40, adeniiie at PS41 , thymine at 
PS44, and adenine at PS45. In a preferred embodiment, the polymorphic variant comprises a naturally- 
occurring isogene of the IL4Ra gene which is defined by any one of haplotypes 1-53 shown in Table 5 
below. 

Polymorphic variants of the invention may be prepared by isolating a clone containing the IL4Ra 
35 gene from a himian genomic library. The clone may be sequenced to determine the identity of the 

nucleotides at the polymorphic sites described herein. Any particular variant claimed herein could be 
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prepared fiiom this clone by perfonning in vitro mutagenesis using procedures well-known in the art* 

IL4Ra isogenes may be isolated using any method that allows separation of the two '^copies'' of 
the IL4Ra gene present in an individual, which, as readily understood by the skilled artisan, may be the 
same allele or different alleles. Separation methods include targeted iit vivo cloning (TTVC) in yeast as 
5 described in WO 98/01573, U.S. Patent No. 5,866,404, and copending U.S. application Serial No. 

08/987,966. Another method, which is described in copending U.S. Application Serial No. 08/987,966, 
uses an allele specific oligonucleotide in combination with primer extension and exonuclease degradation 
to generate hemizygous DNA targets. Yet other methods are single molecule dilution (SMD) as 
described in Ruafio et al., Proc. Natl. Acad. Sci. 87:6296-6300, 1990; and allele specific PGR (Ruafio et 

10 al„ 17 Nucleic Acids. Res. 8392, 1989; Ruano et aL, 19 Nucleic Acids Res. 6877-6882, 1991; 
Michalatos-Beloin et al., 24 Nucleic Acids Res. 4841-4843, 1996). 

The invention also provides IL4Ra genome anthologies, which are collections of IL4Ra isogenes 
found in a given population. The population may be any group of at least two individuals, including but 
not limited to a reference population, a population group, a femily population* a clinical population, and a 

15 same sex population. An IL4Ra genome anthology may comprise individual IL4Ra isogenes stored in 
separate containers such as microtest tubes, sqparate wells of a microtitre plate and the like. 
Alternatively, two or more groups of the IL4Ra isogenes in the anthology may be stored in separate 
containers. Individual isogenes or groups of isogenes in a genome anthology may be stored in any 
convenient and stable form, including but not limited to in buffered solutions, as DNA precipitates, 

20 freeze-dried preparations and the like. A preferred IL4Ra genome anthology of the invention comprises 
. a set of isogenes defined by the haplotypes shown in Table 5 below. 

An isolated polynucleotide containing a polymorphic variant nucleotide sequence of the invention 
may be operably linked to one or more expression regulatory elements in a recombinant expression vector 
capable of being propagated and expressing the encoded IL4Ra protein in a prokaryotic or a eukaryotic 

25 host cell. Examples of expression regulatory elements which may be used include, but are not limited to, 
the lac system, operator and promoter regions of phage lambda, yeast promoters, and promoters derived 
from vaoQinia virus, adenovirus, retroviruses, or SV40. Other regulatory elements include, but are not 
limited to, appropriate leader sequences, termination codons, polyadenylation signals, and other 
sequences required for the appropriate transcription and subsequent translation of the nucleic acid > 

30 sequence in a given host cell. Of course, the correct combinations of expression regulatory elements will 
depend on the host system used. In addition, it is imderstood that the expression vector contains any 
additional elements necessary for its transfer to and subsequent replication in the host cell. Examples of 
such elements include, but are not limited to, origins of replication and selectable markers. Such 
expression vectors are commercially available or are readily constructed using methods known to those in 

35 the art (e.g., F. Ausubel et al., 1987, in **Current Protocols in Molecular Biology", John Wiley and Sons, 
New York, New York). Host cells which may be used to express the variant IL4Ra sequences of the 
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inv ntion include, but are not limited to, eukaiyotic and manunalian cells, such as animal, plant, insect 
and yeast cells, and piokaiyotic cells, such as £. coli, or algal cells as known in the art. The recombinant 
expression vector may be introduced into the host cell using any method knovm to those in the art 
including, but not limited to, microinjection, electroporation, particle bombardment, tran^sduction, and 
5 transfection using DEAE-dextran, lipofection, or calcium phosphate (see e.g., Sambrook et aL (1989) in 
"Molecular Cloning. A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). In a 
preferred aspect, eukaryotic expression vectors that function in eukaryotic cells, and preferably 
mammalian cells, are used. Non-limiting examples of such vectors include vaccinia virus vectors, 
adenovirus vectors, herpes virus vectors, and baculovirus transfer vectors. Preferred eukaiyotic cell lines 

10 include COS cells, CHO cells, HeLa cells, NIH/3T3 cells, and embryonic stem cells (Thomson, J. A. et 
al., 1998 Science 282:1 145-1 147). Particularly preferred host cells are mammalian cells. 

As will be readily recognized by the skilled artisan, expression of polymorphic variants of the 
IL4Ra gene will produce IL4Rd mRNAs varying from each other at any polymorphic site retained in the 
spliced and processed mRNA molecules. These mRNAs can be used for the prqiaration of an IL4Ra 

15 cDNA comprising a nucleotide sequence which is a polymorphic variant of the IL4Ra reference coding 
sequence shown in Figure 2. Thus, the invention also provides IL4Ra mRNAs and conesponding 
cDNAs which comprise a nucleotide sequence that is identical to S£Q ID NO:2 (Fig.' 2), or its 
corresponding RNA sequence, except for having one or more polymorphisms selected from the group 
consisting of thymine at a position corresponding to nucleotide 237, adenine at a position corresponding 

20 to nucleotide 244, cj^osine at a position corresponding to nucleotide 29 1 , thymine at a position 

corresponding to nucleotide 501 , adenine at a position corresponding to nucleotide 554, cytosine at a 
position corresponding to nucleotide 939, thymine at a position corresponding to nucleotide 1242, 
thymine at a position corresponding to nucleotide 1293, cytosine at a position corresponding to nucleotide 
1299, thymine at a position corresponding to nucleotide 1 70 1 , adenine at a position corresponding to 

25 nucleotide 1735, thymine at a position corresponding to nucleotide 2023, guamne at a position 

corresponding to nucleotide 2254 and cytosine at a position corresponding to nucleotide 2397, and may 
also comprise one or more additional polymorphisms selected from the group consisting of guanine at a 
- position corresponding to 223, cytosine at a position corresponding to nucleotide 1 1 99, cytosine at a 
position corresponding to 1291, cytosine at a position corresponding to nucleotide 1507 and guanine at a 

30 position corresponding to 1737. Fragments of these variant mRNAs and cDNAs are included in the scope 
of the invention, provided they contain the novel polymorphisms described herein. The invention 
specifically excludes polynucleotides identical to previously identified and characterized IL4Ra cDNAs 
and fragments thereof. Polynucleotides conxprising a variant RNA or DNA sequence may be isolated 
from a biological sample using weU-known molecular biological procedures or may be chemically 

35 synthesized. 

Genomic and cDNA fragments of the invention comprise at least one novel polymorphic site 
identified herein and have a length of at least 10 nucleotides and may range up to the fiiU If^oth nf th«^ 
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gene. Preferably, a fragment according to the present invention is between 100 and 3000 nucleotides in 
length, and more preferably between 200 and 2000 nucleotides in length, and most preferably between 
500 and 1000 nucleotides in length. 

In describing the polymorphic sites identified herein, reference is made to the sense strand of the 
5 gene for convraiience. However, as recognized by the skilled artisan, nucleic acid molecules containing 
the IL4Ra gene may be complementary double stranded molecules and thus reference to a particular site 
on the sense strand refers as well to the corresponding site on the complementary antisense strand. Thus, 
reference may be made to the same polymorphic site on either strand and an oligonucleotide may be 
designed to hybridize specifically to either strand at a target region containing the polymorphic site. 

10 Thus, the invention also includes single-stranded polynucleotides which are complementary to the sense 
strand of the IL4Ra genomic variants described herein. 

Polynucleotides comprising a polymorphic gene variant or fragment may be useftil for 
therapeutic purposes. For example, where a patient could, benefit from expression, or increased 
expression, of a particular IL4Ra protein isoform, an eiqxression vector encoding the isoform may be 

15 administered to the patient. The patient may be one who lacks the IL4Ra isogene encoding that isoform 
or may already have at least one copy of that isogene. 

In other situations, it may be desirable to decrease or block expression of a particular IL4Ra 
isogene. Expression of an IL4Ra isogene may be turned off by transforming a targeted organ, tissue or 
cell population with an expression vector that expresses high levels of untranslatable mRNA for the 

20 isogene. Alternatively, oligonucleotides directed against the regulatory regions (e.g., promoter, introns, 
enhancers, 3 ' untranslated region) of the isogene may block transcription. Oligonucleotides targeting the 
transcription initiation site, e.g., between positions —10 and +10 from the start site are preferred. 
Similarly, inhibition of transcription can be achieved using oligonucleotides that base-pair with region(s) 
of the isogene DNA to form triplex DNA (see e.g.. Gee et al. in Huber, B.R and B.I. Carr, Molecular and 

25 Immunologic Approaches, Futura Publishing Co., Mt. Kisco, N.Y., 1994). Antisense oligonucleotides 
may also be designed to block translation of ]L4Ra mRNA transcribed fix»m a particular isogene. It is 
also contemplated that ribozymes may be designed that can catalyze the specific cleaviage of IL4Ra 
mRNA transcribed from a particular isogene. 

The oligonucleotides may be delivered to a target cell or tissue by expression from a vector 

30 introduced into the cell or tissue in vivo or ex vivo, Altematively, the oligonucleotides may be formulated 
as a pharmaceutical composition for administration to the patient. Oligoribonucleotides and/or 
oligodeoxynucleotides intended for use as antisense oUgonucleotides may be modified to increase 
stabihty and half-life. Possible modifications include, but are not limited to phosphorothioate or 2' O- 
methyl linkages, and the inclusion of nontraditional bases such as inosine and queosine, as well as acetyl-, 

35 methyl-, thio-, and similarly modified forms of adenine, cytosine, guanine, thymine, and uracil which are 
not as easily recognized by endogenous nucleases. 



15 



wo 01/04270 PCT/USOO/19094 

The invention also provides an isolated polypeptide comprising a polymoiphic variant of the 
reference IL4Ra amino acid sequence shown in 3. The location of a variant amino acid in an IL4Ra 
polypq)tide or fiagment of the invention is identified by aligning its sequence against Fig. 3. An IL4Ra 
protein variant of the invention comprises an amino acid sequence identical to SEQ ED NO: 3 except for 

5 having one or more variant amino acids selected from the group consisting of threonine at a position 
corresponding to amino acid 82, histidine at a position corresponding to amino acid 185, isoleucine at a 
position corresponding to amino acid 579, serine at a position corresponding to amino acid 675, and 
alanine at a position corresponding to amino acid 752, and may also comprise one or more additional 
variant amino acids selected from the group consisting of valine at a position corresponding to amino acid 
10 75, alanine at a position corresponding to amino acid 400, arginine at a position correpsonding to amino 
acid 431, proline at a position corresponding to amino acid 503, and arginine at a position corresponding 
to amino acid 576. The invention specifically excludes amino acid sequences identical to those previously 
identified for IL4Ra, including SEQ ID NO: 3, and previously described fragments thereof. IL4Ra 
protein variants included within the invention comprise all amino acid sequences based on SEQ ID NO: 3 

5 and having the combination of amino acid variations described in Table 2 below. In preferred 

embodiments, an IL4Ra protein variant of the invention is encoded by an isogene defined by one of the 
observed haplotypes shown in Table 5. ^ 

20 



25 



30 



35 



16 



wo 01/04270 



PCTAJSOO/19094 



Table2. Novel Polymorphic Variant of IL4Ra 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


762 


54 




A 




E 


R 


p 


R 


1 


P 


A 


55 


, 


A 


R 


E 


R 


p 


R 


1 


S 


s 


56 


1 


A 




E 


R 


p 


R 


1 


s 


A 


67 


p — 






A 




s 




V 


p 


A 




j 

. — 




R 


A 




c 
0 


x*< 


V 


q 


q 
0 






A 


R 


A 


n 


Q 
0 




w 

V 


q 
0 


A 

/A 


en 


j 


A 


R 


A 






n 


1 

1 


p 


q 
0 


\f 1 


j 


A 


R 


A 




0 


n 


1 

1 


p 


A 




j 




R 


A 




c 
0 


n 


1 

1 


q 


q 

0 


uo 


j 


A 


Fx 


A 


c: 


Q 
0 


n 


1 

1 


q 
0 


A 

•/A 


RA 


j 

: — . — 


A 


R 


A 




0 
w 


R 
rx 


V 


p 


A 

r\ 






■ A 


R 


A 
rx 




0 


R 

IX 


w 

V 


q 
0 


q 


00 


— j — 


A 


R 
fx 


A 




Q 


R 
fx 


V 


q 
0 


A 
/A 


fi7 


— j — 


A 


R 


A 




0 


R 
fx 


1 
f 


p 


q 




— j — 


A 


R 


A 


p 


q 
0 


R 
rx 


1 
1 


P 


A 




— j — 


A 


R 


A 




q 
0 


R 

rx 


1 
1 


q 


q 


7n 

f w 


— j — 


A 


R 


A 


p 


q 
0 


0 
rx 




q 
0 


A 
A\ 


71 


— j — 


A 


R 


A 


p 


P 




V 


p 


A 
/A 


f 4^ 


— j — 


A 


R 

rx 


A 


p 


p 


0 


V 


q 




i 0 


— j — 


A 


R 
fx 


A • 


p 


p 

r^ 




w 

V 


Q 
s> 


A 
M 


7A 


— j — 


A 


R 

• rx 


A 


p 


p 


rx 
W 


1 
1 


P 
f 




7«; 


— j — 


A 


R 

rx 


A 


p 


P 
1 




i 

1 


P 


A 

r\ 


7R 


— j — 


A 


R 
fx 


A 


p 


p 




1 

1 


q 
0 


q 


f # 


— j — 


A 
r\ 


R 

fx 


A 




P 
1 


r\ 
W 


1 
1 


q 


A 

M 


7R 


— j — 


A 


R 

IX 


A 


p 


p 


R 

fx. 


V . 


p 


A 

/A 


7Q 


— j — 


A 


R 

rx 


A 


P 


P 

r 


R 

IX 


V 


q 


q 




— j — 

— . — 


A 


R 
fx 


A 


p 


P 


R 
rx 


w 

V 


q 
0 


A 

/A 


R1 




A 


R 
rx 


A 


p 


P 

r 


D 
IX 


1 
1 


p 




R9 
0^ 


— j — 


A 


R 
rx 


A 

/A 


p 


P 


R 

rx 


1 

1 


p 


A 




- — j — 


A 


R 
rx 


A 


p 


P ' 


R 

IX 


1 ' 

1 


q 
0 


q 
0 




— j — 


A 


R 

rx 


A 


p 


p 


R 
rx 


1 

1 


q 


A 

/A 




— j — 


A 


R 
rx 


A 


R 
rx 


q 


n 


w 

V 


P 


A 
AA 




— j — 




R 


/\ 


R 


s 


0 


w 

V 


q 


q 
0 


87 


— ^ — 


A 


R 


A 


R 


s 


VK 


v 


S 


A 


88 


— j — 




R 


A 


R 


s 




1 


P 


e 


89 


— j — 


A 


R 


A 


R 


s 


0 


1 


P 


A 


90 


— j — 


A 


R 


A 


R 


s 


Q 


1 


s 


s 


91 




A 


R 


A 


R 


s 


Q 


1 




A 


92 


1 


A 


R 


A 


R 


s 


R 


V 


p 


A 


93 


J ■ 


A 


R 


A 


R 


s 


R 


V 


s 


S 


94 




A 


R 


A 


R 


s 


R 


V 


s 


A 


95 




A 


R 


A 


R 


s 


R 


1 


p 


S 


96 




A 


R 


A 


R 


s 


R 


1 


p 


A 


97 




A 


R 


A 


R 


s 


R 


1 


s 


S 


98 




A 


R 


A 


R 


s 


R 


1 


s 


A 


99 




A 


R 


A 


R 


p 


Q 


V 


p 


A 


100 




A 


R 


A 


R 


p 


Q 


V 


s 


S 


101 




A 


R 


A 


R 


p 


Q 


V 


s 


A 


102 




A 


R 


A 


R 


R 


Q 


1 


p 


S 


103 




A 


R 


A 


R 


p 


Q 


1 


p 


A 


104 




A 


R 


A 


R 


p 


Q 


1 


s 


S 


105 




A 


R 


A 


R 


p 


Q 


1 


s 


A 


106 




A 


R 


A 


R 


p 


R 


V 


p 


A 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Numt>er 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


107 


1 


A 


R 


A 


R 


P 


R 


V 


S 


S 


10& 


\ 


A 


R 


A 


R 


P 


R 


V 


S 


A 


109 


1 


A 


R 


A 


R 


P 


R 


1 


P 


S 


110 


1 


A 


R 


A 


R 


P 


R 


1 


P 


A 


111 


1 


A 


R 


A 


R 


P 


R 


1 


s ^ 


S 


112 


1 ■ 


A 


R 


A 


R 


P 


R 


1 


s 


A 


113 


1 


A 


H 


E 


C 


S 


Q 


V 


p 


S 


114 


1 


A 


H 


E 


C 


s 


Q 


V 


p 


A 


115 


1 


A 


H 


E 


c 


s 


Q 


V 


s 


S 


116 


1 


A 


H 


E 


c 


s 


Q 


V 


s 


A 


117 


1 


A 


H 


E 


c 


s 


Q 


1 


p 


S 


118 


1 


A 


H 


E 


c 


s 


Q 


1 


p 


A 


119 


1 


A 


H 


E 


c 


s 


Q 


1 


s 


S 


120 


1 


A 


H 


E 


c 


s 


Q 


1 


s 


A 


121 


1 


A 


H 


E 


c 


s 


R 


V 


p 


S 


122 


j 


A 


H 


E 


c 


s 


R 


V 


p 


A 


123 


1 


A 


H 


E 


c 


s 


R 


V 


s 


S 


124 


1 


A 


H 


E 


c 


s 


R 


V 


s 


A 


125 


1 


A 


H 


E 


c 


s 


R 


1 


p 


S 


126 


1 


A 


H 


E 


c 


s 


R 


1 


p 


A 


127 


1 


A 


H 


E 


c 


s 


R 


1 


s 


S 


128 


1 


A 


H 


E 


c 


s 


R 


1 


s 


A 


129 


1 


A 


H 


E 


c 


p 


Q 


V 


p 


S 


130 


1 


A 


H 


E 


c 


p 


Q 


V 


p 


A 


131 


1 


A 


H 


E 


c 


p 


Q 


V 


s 


S 


132 


1 


A 


H 


E 


c 


p 


Q 


V 


s . 


A 


133 


1 


A 


H 


E 


c 


p 


Q 


1 


p 


S 


134 


1 


A 


H 


E 


c 


p 


Q 


1 


p 


A 


135 


1 


A 


H 


E 


c 


p 


Q 


1 


s 


S 


136 


1 


A 


H 


E 


c 


p 


Q 


1 


s 


A 


137 


1 


A 


H 


E 


c 


p 


R 


V 


p 


S 


138 


1 


A 


H 


E 


c 


p 


R 


V 


p 


A 


139 


1 


A 


H 


• E 


c 


p 


R 


V 


s 


S 


140 


1 


A 


H 


E 


c 


p 


R 


V 


s 


A 


141 


1 


A 


H 


E 


c 


p 


R 


1 


p 


S 


142 


1 


A 


H 


E 


c 


p 


R 


1 


p 


A 


143 


1 


A 


H 


E 


c 


p 


R 


1 


s 


S 


144 , 


1 


A 


H 


E 


c 


p 


R 


1 


s 


A 


145 


1 


A 


H 


E 


R 


s 


Q 


V 


p 


S 


146 


1 


A 


H 


E 


R 


s 


Q 


V 


p 


A 


147 


1 


A 


H 


E 


R 


s 


Q 


V 


s 


S 


148 




A 


H 


E 


R 


s 


Q 


V 


s 


A 


149 




A 


H 


E 


R 


s 


Q 


1 


p 


- S 


150 




A 


H 


E 


R 


s 


Q 


1 


p 


A 


151 




A 


H 


E 


R 


s 


Q 


1 


s 


S 


152 




A 


H 


E 


R 


s 


Q 


1 


s 


A 


153 




A 


H 


E 


R 


s 


R 


V 


p 


S 


154 




A 


H 


E 


R 


s 


R 


V 


p 


A 


155 




A 


H 


E 


R 


s 


R 


V 


s 


S 


156 




A 


H 


E 


R 


s 


R 


V 


s 


A 


157 




A 


H 


E 


R 


s 


R 


1 


p 


S 


158 




A 


H 


E 


R 


s 


R 


1 


p 


A 


159 




A 


H 


E 


R 


s 


R 


1 


s 


S 
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Tabie2. Novel Polymorphic Variant of IL4Ra (conf d) 



Polymorphtic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


679 


675 


752 


160 


1 


A 


H 


E 


R 


S 


R 


1 


S 


A 


161 


1 


A 


H 


E 


R 


P 


Q 


V 


p 


S 


162 


1 


A 


H 


E 


R 


P 


Q 


V 


p 


A 


163 


1 ' 


A 


H. 


E 


R 


P 


Q 


V 


s 


S 


164 


1 


A . 


H 


E 


R 


P 


Q 


V 


s 


A 


165 


1 


A 


H 


E 


R 


P 


Q 


1 


p 


S 


166 


1 


A 


H 


E 


R 


P 


Q 


1 


p 


A 


167 


1 


A 


H 


E 


R 


P 


Q 


1 


s 


S 


168 


1 


A 


H 


E 


R 


P 


Q 


1 


s 


A 


169 


1 


A 


H 


E 


R 


P 


R 


V 


p 


S 


170 


1 


A 


H 


E 


R 


P 


R 


V 


p 


A 


171. 


1 


A 


H 


E 


R 


P 


R 


V 


s 


S 


172 


1 


A 


H 


E 


R 


P 


R 


V 


s 


A 


173 


1 


A 


H 


E 


R 


P 


R 


1 


p 


S 


174 


1 


A 


H 


E 


R 


P 


R 


1 


p 


A 


175 


1 


A 


H 


E 


R 


P 


R 


1 


s 


S 


176 


1 


A 


H 


E 


R 


P 


R 


1 


s 


A 


177 


1 


A 


H 


A 


C 


s 


Q 


V 


p. 


S 


178 


1 


A 


H 


A 


C 


s 


Q 


V 


p 


A 


179 


1 


A 


H 


A 


C 


s 


Q 


V 


s 


S 


180 


1 


A 


H 


A 


C 


s 


Q 


V 


s 


A 


181 


1 


A 


H 


A 


C 


s 


Q 


1 


p 


S 


182 


1 


A 


H 


A 


C 


s 


Q 


1 


p 


A 


183 


1 


A 


H 


A 


C 


s 


Q 


1 


s . 


S 


184 


1 


A 


H 


A 


C 


s 


Q 


1 


s 


A 


185 


1 


A 


H 


A 


c 


s 


R 


V 


p 


S 


186 


1 


A 


H 


A 


c 


s 


R 


V 


. p 


A 


187 


1 


A 


H 


A 


c 


s 


R 


V 


s 


S 


188 


1 


A 


H 


A 


c 


s 


R 


V 


s 


A 


189 


1 


A 


H 


A 


c 


s 


R 


1 


p 


S 


190 


i 


A 


H 


A 


c 


s 


R 


1 


p . 


A 


191 


1 


A 


H 


A 


c 


s 


R 


1 


s 


S 


192 


1 


A 


H 


A 


c 


. s 


R 


1 


s 


A 


193 


1 


A 


H 


A 


c 


p 


Q 


V 


p 


S 


194 


I 


A 


H 


A 


c 


p 


Q 


V 


p 


A 


195 


1 


A 


H 


A 


c . 


p 


Q 


V 


s 


S 


196 


1 


A 


H 


A 


c 


p 


Q 


V 


s 


A 


197 


1 


A 


H 


A 


c 


p 


Q 


1 . 


p 


S 


198 


1 


A 


H 


A 


c 


p 


Q 


1 


p 


A 


199 


1 


A 


H 


A 


c 


p 


Q 


1 


s 


S 


200 


1 


A 


H 


A 


c 


p 


Q 


1 


s 


A 


201 




A 


H 


A 


c 


p 


R 


V 


p 


S 


202 




A 


H 


A 


c 


p 


R 


V 


p 


A 


203 




A 


H 


A 


c 


p 


R 


V 


s 


S 


204 




A 


H 


A 


c 


p 


R 


V 


s 


A 


205 




A 


H 


A 


c 


p 


R 


1 


p 


S 


206 




A 


H 


A 


c 


p 


R 


1 


p 


A 


207 




A 


H 


A 


c 


.p 


R 


1 


s 


S 


208 




A 


H 


A 


c 


p 


R 


1 


s 


A 


209 




A 


H 


A 


R 


s 


Q 


V 


p 


S 


210 




A 


H 


A 


R 


s 


Q 


V 


p 


A 


211 




A 


H 


A 


R 


s 


Q 


V 


s 


S 


212 




A 


H 


A 


R 


s 


Q 


V 


s 


A 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Id ntities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


213 


1 


A 


H 


A 


R 


s 


Q 


1 


P 


S 


214 


1 


A 


H 


A 


R 


s 


Q 


1 


P 


A 


215 


1 


A 


H 


A 


R 


s 


Q 


1 


s 


S 


216 


1 


A 


H 


A 


R 


s 


Q 


1 


s 


A 


217 


1 


A 


H 


A 


R 


s 


R 


V 


P . 


s 


218 


1 


A 


H 


A 


R 


s 


R 


V 


p 


A 


219 


1 


A 


H 


A 


R 


s 


R 


V 


s 


S 


220 


1 


A 


H 


A 


R 


s 


R 


V 


s 


A 


221 


1 


A 


H 


A 


R 


s 


R 


1 


p 


S 


222 


1 


A 


H 


A 


R 


s 


R 


1 


p 


A 


223 


1 


A 


H 


A 


R 


s 


R 


1 


s 


S 


224 


1 


A 


H 


A 


R 


s 


R 


1 


s 


A 


225 


1 


A 


H 


A 


R 


P 


Q 


V 


p 


S 


226. 


1 


A 


H 


A 


R 


p 


Q 


V 


p 


A 


227 


1 


A 


H 


A 


R 


p 


Q 


V 


s 


S 


228 


1 


A 


H 


A 


R 


p 


Q 


V 


s 


A 


229 


1 


A 


H 


A 


R 


p 


Q 


1 


p 


S 


230 


1 


A 


H 


A 


R 


p 


Q 


1 


p 


A 


231 




A 


H 


A 


R 


p 


Q 


1 


s 


S 


232 


1 


A 


H 


A 


R 


p 


Q 


1 


s 


A 


233 


1 


A 


H 


A 


R 


p 


R 


V 


p 


S 


234 


1 


A 


H 


A 


R 


p 


R 


V 


p 


A 


235 


1 


A 


H 


A 


R 


p 


R 


V 


s 


S 


236 


1 


A 


H 


A 


' R 


p 


R 


V 


s 


A 


237 


1 


A 


H 


A 


R 


p 


R 


I 


p 


S 


238 


1 


A 


H 


A 


R 


p 


R 


1 


p 


A 


239 


1 


A 


H 


A 


R 


p 


R 


1 


s 


S 


240 


1 


A 


H 


A 


R 


p 


R 


1 


s 


A 


241 


1 


T 


R 


E 


C 


s 


Q 


V 


p 


S 


242 


1 


T 


R 


E 


C 


s 


Q 


V 


p 


A 


243 


1 


T 


R 


E 


C 


s 


Q 


V 


s 


S 


244 


1 


T 


R 


E 


C 


s 


Q 


V 


s 


A 


245 


j 


T 


R 


E 


C 


s 


Q 


1 


p 


S 


246 


1 


T 


R 


E • 


C 


s 


Q 


1 


p 


A 


247 


1 


T 


R 


E 


C 


s 


Q 


1 


s 


S 


248 


1 


T 


R 


E 


c 


s 


Q 


1 


s 


A 


249 


1 


T 


R 


E 


c 


s 


R 


V 


p 


S 


250 


1 


T 


R - 


E 


c 


s 


R 


V 


p 


A 


251 


1 


T 


R 


E 


c 


s 


R 


V 


s 


S 


252 


I 


T 


R 


E 


c 


s 


R 


V 


s 


A 


253 


1 


T 


R 


E 


c 


s 


R 


1 


p 


S 


254 




T 


R 


E 


c 


s 


R 


1 


p 


A 


255 




T 


R 


E 


c 


s 


R 


1 


s 


S 


256 




T 


R 


E 


c 


s 


R 


1 


s 


A 


257 




T 


R 


E 


c 


p 


Q 


V 


p 


S 


258 




T 


R 


E 


c . 


p 


Q 


V 


p 


A 


259 




T 


R 


E 


c 


p 


Q 


V 


s 


S 


260 




T 


R 


E 


c 


p 


Q 


V 


s 


A 


261 




T 


R 


E 


c 


p 


Q 


• 1 


p 


S 


262 




T 


R 


E 


c 


p 


Q 


1 


p 


A 


263 




T 


R 


E 


c 


p 


Q 


1 


s 


S 


264 




. T 


R 


E 


c 


p 


Q 


1 


s 


A 


265 




T 


R 


E 


c 


p 


R 


V 


p 


S 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 . 


82 


185 


400 


431 


503 


576 


579 


675 


752 


266 


1 


T 


R 


E 


C 


P 


R 


V 


P 


A 


267 


1 


T 


R 


E 


C 


P 


R 


V 


s 


S 


268 


I 


T . 


R 


E 


C 


P 


R 


V 


s 


A 


269 


1 


T 


R 


E 


C 


P 


R 


I 


P 


S 


270 


1 


T 


R 


E 


C 


P 


R 


1 


p 


A 


271 


1 


T 


R 


E 


C 


P 


R 


1 


s 


S 


272 


1 


T 


R 


E 


C 


P 


R 


1 


s 


A 


273 


1 


T 


R 


E 


R 


S 


Q 


V 


p 


S 


274 


1 


T 


R 


E 


R 


S 


Q 


V 


p 


A 


275 


1 


T 


R 


E 


R 


S 


Q 


V 


s 


S 


276 


1 


T 


R 


E 


R 


S 


Q 


V 


s 


A 


277 


j 


T 


R 


E 


R 


S 


Q 


1 


p 


S 


278 


1 


T 


R 


E 


R 


S 


Q 


1 


p 


A 


279 


I 


T 


R 


E 


R 


S 


Q 


1 


s 


S 


280 


1 


T 


R 


E 


R 


S 


Q 


1 


s 


A 


281 


1 


T 


R 


E 


R 


S 


R 


V 


p 


S 


282 


1 


T 


R 


E 


R 


S 


R 


V 


p 


A 


283 


1 


T 


R 


E 


R 


S 


R 


V 


s 


S 


284 


1 


T. 


R 


E 


R 


S 


R 


V 


s 


A 


285 


1 


T 


R 


E 


R 


s 


R 


1 


p 


S 


286 


1 


T 


R 


E 


R 


s 


R. 


1 


p 


A 


287 


1 


T 


R 


E 


R 


s 


R 


1 


s 


S 


288 


1 


T 


R 


E 


R 


s 


R 


1 


s 


A 


289 


1 


T 


R 


E 


R 


p 


Q 


V 


p 


S 


290 


1 


T 


R 


E 


R 


p 


Q 


V 


p 


A 


291 


1 


T 


R 


E 


R 


p 


Q 


V 


s 


S 


292 


1 


T 


R 


E 


R 


p 


Q 


V 


s 


A 


293 


1 


T 


R 


E 


R 


p 


Q 


1 


p 


S 


294 


1 


T 


R 


E 


R 


p 


Q 


1 


p 


A 


295 


1 


T 


R 


E 


R 


p. 


Q 


1 


s 


S 


296 


1 


T 


R 


E 


R 


p 


Q 


1 


s 


A 


297 


1 


T 


R 


♦ E 


R 


p 


R 


V 


p 


S 


298 


1 


T 


R 


E 


R 


p 


R 


V 


p 


A 


299 


1 


T 


R 


E 


R 


p 


R 


V 


s 


S 


300 


1 


T 


R 


E 


R 


p 


R 


V 


s 


A 


301 


1 


T 


R 


E 


R 


p 


R 


1 


p 


S 


302 


1 


T 


R 


E 


R 


p 


R 


1 


p 


A 


303 


1 


T 


R 


E 


R 


p 


R 


1 


s 


S 


304 


1 


T 


R 


E 


R 


p 


R 


1 


s 


A 


305 


1 


T 


R 


A 


C 


s 


Q 


V 


p 


S 


315 


1 


T 


R 


A 


C 


s 


R 


V 


s 


S 


316 




T 


R 


A 


C 


s 


R 


V 


s 


A 


317 




T 


R 


A 


C 


s 


R 


1 


p 


S 


318 




T 


R 


A 


C 


s 


R 


1 


p 


A 


319 




T 


R 


A 


C 


s 


R 


1 


s 


S 


320 




T 


R 


A 


C 


s 


R 


1 


s 


A 


321 




T 


R 


A 


C 


p 


Q 


V 


p 


S 


322 




T 


R 


A 


C 


p 


Q 


V 


p 


A 


323 




' T 


R 


A 


C 


p 


Q 


V 


s 


S 


324 




T 


R 


A 


C 


p 


Q 


V 


s 


A 


325 




T 


R 


A 


C 


p 


Q 


1 


p 


S 


326 




T 


R 


A 


C 


p 


Q 


1 


p 


A 


327 




T 


R 


A 


C 


p 


Q 


1 


s 


S 
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Table2. Novel Polymorphic Variant of IL4Ra (conf d) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


328 


1 


T 


R 


A 


C 


P 


Q 


1 


S 


A 


306 


1 


T 


R 


A 


C 


S 


Q 


V 


P 


A 


307 


1 


T 


R 


A 


C 


S 


Q 


V 


s 


S 


308 


1 


T 


R 


A 


C 


S 


Q 


V 


s 


A 


309 


1 




R 


A 


C 


S 


Q 


1 


p 


S 


310 


1 


T 


R 


A 


C 


s 


Q 


1 


p 


A 


311 


1 


T 


R 


A 


C 


s 


Q 


1 


s 


S 


312 


1 


T 


R 


A 


C 


s 


Q 


1 


s 


A 


313 


1 


T 


R 


A 


C 


s 


R 


V 


p 


S 


314 


i 


T 


R 


A 


C 


s 


R 


V 


p 


A 


329 


1 


T 


R 


A 


C 


p 


R 


V 


p 


S 


330 


1 


T 


R 


A 


C 


p 


R 


V 


p 


A 


331 


1 


T 


R 


A 


C 


p 


R 


V 


s 


S 


332 


1 


T 


R 


A 


C 


. p 


R 


V 


s 


A 


333 


1 


T 


R 


A 


C 


p 


R 


1 


p 


S 


334 


1 


T 


R 


A 


C 


p 


R 


1 


p 


A 


335 


1 


T 


R 


A 


c 


p 


R 


1 


s 


S 


336 


1 


T 


R 


A 


c 


p 


R 


1 


s 


A 


337 


1 


T 


R 


A 


R 


s 


Q 


V 


p 


S 


338 


1 


T 


R 


A 


R 


s 


Q 


V 


p 


A 


339 


1 


T 


R 


A 


R . 


s 


Q 


V 


s 


S 


340 


1 


T 


R 


A 


R 


s 


Q 


V 


s 


A 


341 


1 


T 


R 


A 


R 


s 


Q 


1 


p 


S 


342 


1 


T 


R 


A 


R 


s 


Q 


1 


p 


A 


343 


1 • 


T 


R 


A 


R 


s 


Q 


1 


s 


S 


344 


1 


T 


R 


A 


R 


s 


Q 


1 


s 


A 


345 


1 


T 


R 


A 


R 


s 


R 


V 


R 


S 


346 


1 


T 


R 


A 


R 


s 


R 


V 


p 


A 


347 


1 


T 


R 


A 


R 


s 


R 


V 


s 


S 


348 


1 


T 


R 


A 


R 


s 


R 


V 


s 


A 


349 


• 1 


T 


R 


A 


R 


s 


R 


1 


p 


S 


350 


1 


T 


R 


A 


R 


s 


R 


1 


p 


A 


351 


1 


T 


R 


A 


R 


s 


R 


1 


s 


S 


352 


1 


T 


R 


A 


R 


s 


R 


1 


s 


A 


353 


1 


T 


R 


A 


R 


p 


Q 


V 


p 


S 


354 


1 


T 


R 


A 


R 


p 


Q 


V 


p 


A 


355 


1 


T 


R 


A 


R 


p 


Q 


V 


s 


S 


356 


1 


T 


R 


A 


R 


p 


Q 


V 


s 


A 


357 


1 


T 


R 


A 


R 


p 


Q 


1 


p 


S 


358 


1 


T 


R 


A 


R 


p 


Q 


1 


p 


A 


359 


1 


T 


R 


A 


R 


p 


Q 


1 


s 


S 


360 




T 


R 


A 


R 


p 


Q 


1 


s 


A 


361 




T 


. R 


A 


R 


p 


R 


V 


p 


S 


362 




T 


R 


A 


R 


p 


R 


V 


p 


A 


363 




T 


R 


A 


R 


p 


R 


V 


s 


S 


364 




T 


R 


A 


R 


p 


R 


V 


s 


A 


365 




T 


R 


A 


R 


p 


R 


1 


p 


S 


366 




T 


R 


A 


R 


p 


R 


1 


p 


A 


367 




T 


R 


A 


R 


p 


R 


1 


s 


S 


368 




T 


R 


A 


R 


p 


R 


1 


s 


A 


369 




T 


H 


E 


C 


s 


Q 


V 


p 


S 


370 




T 


H 


E 


C 


s 


Q 


V 


p 


A 


371 




T 


H 


E 


C 


s 


Q 


V 


s 


S 
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Table2. Novel Poiymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


679 


675 


752 


372 


1 


T 


H 


E 


C 


S 


Q 


V 


S 


A 


373 


1 


T 


H 


E 


C 


S 


Q 


1 


P 


S 


374 


1 


T 


H 


E 


c 


S 


Q 


1 


P 


A 


375 


1 


T 


H 


E 


c 


S 


Q 


1 


S 


S 


376 


1 


T 


H 


E 


c 


S 


Q 


1 


S 


A 


377 


1 


T 


H 


E 


c 


s 


R 


V 


P 


S 


378 


1 


T 


H 


E 


c 


s 


R 


V 


P 


A 


379 


1 


T 


H 


E 


c 


s 


R 


V 


s 


S 


380 


1 


T 


H 


E 


c 


s 


R 


V 


s 


A 


381 


1 


T 


H 


E 


c 


s 


R 


1 


p 


S 


382 


1 


T 


H 


E 


c 


s . 


R 


1 


p 


A 


383 


1 


T 


H 


E 


c 


s 


R 


1 


s 


S 


384 


1 


T 


H 


E 


c 


s 


R 


1 


s 


A 


385 


1 


T 


H 


E 


c 


p 


Q 


V 


p 


S 


386 


1 


T 


H 


E 


c 


p 


Q 


V 


p 


A 


387 


- 1 


T 


H 


E 


c 


p 


Q 


V 


s 


S 


388 


1 


T 


H 


E 


c 


p 


Q 


V 


s 


A 


389 


1 


T 


H 


E 


c 


p 


Q 


1 


p 


S 


390 


1 


T 


H 


E 


c 


p 


Q 


1 


p 


A 


391 


1 


T • 


H 


E 


c 


p 


Q 


1 


s 


S 


.392 


1 


T 


H 


E 


c 


p 


Q 


1 


s 


A 


393 


1 


T 


H 


E 


c 


p 


R 


V 


p 


S 


394 


1 


T 


H 


E 


c 


p 


R 


V 


• p 


A 


395 




T 


H 


E 


c 


p 


R 


V 


s 


S 


396 


1 


T 


H 


E 


c 


p 


R 


V 


s 


A 


397 


1 


T 


H 


E 


c 


p 


R 


1 


p 


S 


398 


1 


T 


H 


E 


c 


p 


R 


1 


p 


A 


399 


1 


T 


H 


E 


c 


p 


R 


1 


s 


S 


400 


1 


T 


H 


E 


c 


p 


R 


1 


s 


A 


401 


.| - 


T 


H 


E 


R 


s 


Q - 


V 


p 


S 


402 


1 


T 


H 


E 


R 


s 


Q 


V 


p 


A 


403 


1 


T 


H 


E 


R 


s 


Q 


V 


s 


S 


404 


1 


T 


H 


E 


R 


s 


Q 


V 


s 


A 


405 


1 


T 


H 


E 


R 


s 


Q 


1 


p 


S 


406 


1 


T 


H 


E 


R 


s 


Q 


1 


p 


A 


407 


1 


T 


H 


E 


R 


s 


Q 


1 


s 


S 


408 


1 


T 


H 


E 


R 


s 


Q 


1 


s 


A 


409 


1 


T 


H 


E 


R 


s 


R 


V 


p 


S 


410 


1 


T 


H 


E 


R 


s 


R 


V 


p 


A 


411 


1 


T 


H 


E 


R 


s 


R 


V 


s 


S 


412 


1 


T 


H 


E 


R 


s 


R 


V 


s 


A 


413 




T 


H 


E 


R 


s 


R 


1 


p 


S 


414 




T 


H 


E 


R 


s 


R 


1 


p 


A 


415 




T 


H 


E 


R 


s 


R 


1 


s 


S 


416 




T 


H 


E 


R 


s 


R 


1 


s 


A 


417 




T 


H 


E 


R 


p 


Q 


V 


p 


S 


418 




T 


H 


E 


R 


p 


Q 


V 


p 


A 


419 




T 


H 


E 


R 


p 


Q 


V 


s 


S 


420 




T 


H 


E 


R 


p 


Q 


V 


s 


A 


421 




T 


H 


E 


R 


p 


Q 


1 


p 


S 


422 




T 


H 


E 


R 


p 


Q 


1 


p 


A 


423 




T 


H 


E 


R 


p 


Q 


1 


s 


S 


424 




T 


H 


E 


R 


p 


Q 


1 


s 


A 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


425 


1 


T 


H 


E 


R 


P 


R 


V 


P 


S 


426 


1 


T 


H 


E 


R 


P 


R 


V 


P 


A 


427 


1 


T 


H 


E 


R 


P 


R 


V 


S 


S 


428 


1 


T 


H 


E 


R 


P 


R 


. V 


S 


A 


429 


1 


T 


H 


E 


R 


P 


R 


1 


P 


S 


430 


1 


T 


H 


E 


R 


P 


R 


1 


P 


A 


431 - 


1 


T 


H 


E 


R , 


P 


R 


1 


S 


S 


432 


1 


T 


H 


E 


R 


P 


. R 


1 


s 


A 


433 


1 


T 


H 


A 


C 


S 


Q 


V 


p 


S 


434 


1 


T 


H 


A 


C 


s 


Q 


V 


p 


A 


435 


1 


T 


H 


A 


C 


s 


Q 


V 


s 


S 


436 


1 


. T 


H 


A 


C 


s 


Q 


V 


s 


A 


437 


1 


T 


H 


A 


c 


s 


Q 


1 


p 


S 


438 


1 


T 


H 


A 


c 


s 


Q 


1 


p 


A 


439 


1 


T 


H 


A 


c 


s 


Q 


1 


s 


S 


440 


1 


T 


H 


A 


c 


s 


Q 


1 


s 


A 


441 


1 


T 


H 


A 


c 


s 


R 


V 


p 


S 


442 


1 


T 


H 


A 


c 


s 


R 


V 


p 


A 


443 


1 


T 


H 


A 


c 


s 


R 


V 


s 


S 


444 


1 


T 


H 


A 


c 


s 


R 


V 


s 


A 


445 - 


f 


T 


H 


A 


c 


s 


R 


1 


p 


S 


446 


1 


T 


H 


A 


c 


s 


R 


1 


p 


A 


447 


1 


T 


H 


A 


c 


s 


R 


1 


s . 


S 


448 


1 


T 


H 


A 


c 


s 


R 


1 


s 


A 


449 


1 . 


T 


H 


A 


c 


P 


Q 


V 


p 


S 


450 


1 


T 


H 


A 


- c 


p 


Q 


V 


p 


A 


451 


1 


T 


H 


A 


c 


p 


Q 


V 


s 


S 


452 


1 


T 


H 


A 


c 


p 


Q 


V 


s 


A 


453 


1 


T 


H 


A 


c 


p 


Q 


1 


p 


S 


454 


1 


T 


H 


A 


c 


p 


Q 


1 


p 


A 


455 


1 


T 


H 


A 


c 


p 


Q 


1 


s 


S 


456 


1 


T 


H 


A 


c 


p 


Q 


1 


s 


A 


457 


1 


T 


H 


A 


c 


p 


R 


V 


p 


S 


458 


1 


T 


H 


A 


c 


p 


R 


V 


p 


A 


459 


1 


T 


H 


A 


c 


p 


R 


V 


s 


S 


460 


1 


T 


H 


A 


c 


p 


R 


V 


s 


A 


461 


1 


T 


H 


. A 


c 


p 


R 


1 


p 


S 


462 


1 


T 


H 


A 


c 


p 


R 


1 


p 


A 


463 


1 


T 


H 


A 


c 


p 


R 


1 


s 


S 


464 


1 


T 


H 


A 


c 


p 


R 


1 


s 


A 


465 


1 


T 


H 


A 


R 


s 


Q 


V 


p 


S 


466 




T 


H 


A 


R 


s 


Q 


V 


p 


A 


467 




T 


H 


A 


R 


s 


Q 


V 


s 


S 


468 




T 


H 


A 


R 


s 


Q 


V 


s 


A 


469 




T 


H 


A 


R 


s 


Q 


1 


p 


S 


470 




T 


H 


A 


R 


s 


Q 




p 


A 


471 




T 


H 


A 


R 


s 


Q 


1 


s 


S 


472 




T 


H 


A 


R 


s 


Q 


1 


s 


A 


473 




T 


H 


A 


R 


s 


R 


V 


p 


S 


474. 




T 


H 


A 


R 


s 


R 


V 


p. 


A. 


475 




T 


H 


A 


R 


s 


R 


V 


s 


S 


476 




T 


H 


A 


R 


s 


R 


V 


s 


A 


477 




T 


H 


A 


R 


s 


R 


1 


p 


S 
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Tabie2. Novel Polymorphic Variant of IL4Ra (conf d) 



Polymorpliic 
Variant 



Amino Acid Position and identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


478 


i 


T 


H 


A 


R 


S - 


R 


i 


P 


A 


479 


1 


T 


H 


A 


R 


S 


R 


1 


S 


S 


480 


1 


T 


H 


A 


R 


S 


R 


1 


S 


A 


481 


1 


T 


H 


A 


R 


P 


Q 


V 


P 


S 


482 


1 


T 


H 


A 


R 


P 


Q 


V 


P 


A 


483 


1 


T 


H 


A 


R 


P 


Q 


V 


s 


S 


484 


1 


T 


H 


A 


R 


P 


Q 


V 


s 


A 


485 


1 


T 


H 


A 


R 


P 


Q 


1 


p 


S 


486 


1 


T 


H 


A 


R 


P 


Q 


1 


p 


A 


487 


1 


T 


H 


A 


R 


P 


Q 


1 


s 


S 


488 


1 


T 


H 


A 


R 


P 


Q 


1 


s 


A 


489 


1 


T 


H 


A 


R 


P 


R 


V 


p 


S 


490 


1 


T 


H 


A 


R 


P 


R 


V 


p 


A 


491 


1 


T 


H 


A 


R 


P 


R 


V 


s 


S 


492 


1 


T 


H 


A 


R 


P 


R 


V 


s 


A 


493 


1 


T 


H 


A 


R 


P 


R 


1 


p 


S 


494 


1 


T 


H 


A 


R 


P 


R 


1 


p 


A 


495 


1 


T 


H 


A 


R 


P 


R 


1 


s 


S 


496 


I 


T 


H 


A 


R 


P 


R 


1 


s 


A 


497 


V 


A 


R 


E 


C 


s 


Q 


V 


p 


A 


498 


V 


A 


R 


E 


C 


s 


Q 


V 


s 


S 


499 


V 


A 


R 


E 


C 


s 


Q 


V 


s 


A 


500 


V 


A 


R 


E 


C 


s 


Q 


1 


p 


S 


501 


V 


A 


R 


E 


c 


s 


Q 


1 


p 


A 


502 


V 


A 


R 


E 


c 


s 


Q 


1 


s 


S 


503 


V 


A 


R 


E 


c 


s 


Q 


1 


s 


A 


504 


V 


A 


R 


E 


c 


s 


R 


V 


p 


A 


505 


V 


A 


R 


E 


c 


s 


R 


V 


s 


S 


506 


V 


A 


R 


E . 


c 


s 


R 


V 


s 


A 


507 


V 


A 


R 


E 


c 


s 


R 


1 


p 


S 


508 


V 


A 


R 


E 


c 


s 


R 


1 


p 


A 


509 


V 


A 


R 


E - 


c 


s 


R 


1 


s 


S 


510 


V 


A 


R 


E 


c 


s 


R 


1 


s 


A 


511 


V 


A 


R 


E 


c 


p 


Q 


V 


p 


A 


512 


V 


A 


R 


E 


c 


p 


Q 


V 


s 


S 


513 


V 


A 


R 


E 


c 


p 


Q 


V 


s 


A 


514 


V 


A 


R 


E 


c 


p 


Q 


1 


p 


S 


516 


V 


A 


R 


E 


c 


p 


Q 


1 


p 


A 


516 


V 


A 


R 


E 


c 


p 


Q 


1 


s 


S 


517 


V 


A 


R 


E 


c 


p 


Q 


1 


s 


A 


518 


V 


A 


R 


E 


c 


p 


R 


V 


p 


A 


519 


V 


A 


R 


E 


c 


p 


R 


V 


s 


S 


520 


V 


A 


R 


E 


c 


p 


R 


V 


s 


A 


521 


V 


A 


R 


E 


c 


p 


R 


i 


p 


S 


522 


V 


A 


R 


E 


c 


p 


R 


1 


p 


A 


523 


V 


A 


R 


E 


c 


p 


R 




s 


S 


524 


V 


A 


R 


E 


c 


p 


R 


1 


s 


A 


525 


V 


A 


R 


E 


R 


s 


Q 


V 


p 


A 


526 


V 


A 


R 


E 


R 


s 


Q 


V 


s 


S 


527 


V 


A 


R 


E 


R 


s 


Q 


V 


s 


A 


528 


V 


A 


R 


E 


R 


s 


Q 


1 


p 


S 


529 


V 


A 


R 


E 


R 


s 


Q 


1 


p 


A 


530 


V 


A 


R 


E 


R 


s 


Q 


1 


s 


S 
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Table2, Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


676 


579 


675 


752 


531 


V 


A 


R 


E 


R 


S 


Q 


1 


S 


A 


532 


V 


A 


R 


E 


R 


S 


R 


V 


p 


A 


533 


V 


A 


R 


E 


R 


S 


R 


V 


s 


S 


534 


V 


A 


R 


E 


R 


S 


R 


V 


s 


A 


535 


V 


A 


R 


E 


R 


S 


R 


1 


p 


S 


536 


V 


A 


R 


E 


R 


s 


R 


1 


p 


• A 


537 . 


V 


A 


R 


E 


R 


s 


R 


1 


s 


S 


538 


V 


A 


R 


E 


R 


s 


R 


1 


s 


A 


539 


V 


A 


R 


E 


R 


p 


Q 


V 


p 


A 


540 


V 


A 


* R 


E 


R 


p 


Q 


V 


s 


S 


541 


V 


A 


R 


E 


R 


p 


Q 


V 


s 


A 


542 


V 


A 


R 


E 


R 


p 


Q 


1 


p 


S 


543 


V 


A 


R 


E 


R 


p 


Q 


1 


p 


A 


544 


V 


A 


R 


E • 


R 


p 


Q 


1 


s 


S 


545 


V 


A 


R 


E 


R 


p 


Q 


1 


s 


A 


546 


V 


A 


R 


E 


R 


p 


R 


V 


p 


A 


547 


V 


A 


R 


E 


R 


p 


R 


V 


s 


S 


548 


V 


A 


R 


E 


R 


p 


R 


V 


s 


A 


549 


V 


A 


R 


E 


R 


p 


R 


1 


p 


S 


550 


V 


A 


R 


E 


R 


p 


R 


1 . 


p 


A 


551 


V 


A 


R 


E 


R 


p 


R 


1 


s 


S 


552 


V 


A 


R 


E 


R 


p 


R 


1 


s 


A 


553 


V 


A 


R 


A 


C 


s 


Q 


V 


p 


A 


554 


V 


A 


R 


A 


C 


s 


Q 


V 


s 


S 


555 


V 


A 


R 


A 


C 


s 


Q 


V 


s 


A 


556 


V 


A 


R 


A 


C 


s 


Q 


1 


p 


S 


557 


V 


A 


R 


A 


C 


s 


Q 


1 . 


p 


A 


558 


V 


A 


R 


A 


C 


s 


Q 


1 


s 


S 


559 


V 


A 


R 


A 


C 


s 


Q 


1 


s 


A 


560 


V 


A 


R 


A 


C 


s 


R 


V 


p 


A 


561 


V 


A 


R 


A 


C 


s 


R 


V 


s 


S 


562 


V 


A 


R 


A 


C 


s 


R 


V 


s 


A 


563 


V 


A 


R 


A 


C 


s 


R 


1 


p 


S 


564 


V 


A 


R 


A 


C 


s 


R 


1 


p 


A 


565 


V 


A 


R 


A 


C 


s 


R 


1 


s 


S 


566 


V 


A 


R 


A 


c 


s 


R 


1 


s 


A 


567 


V 


A 


R 


A 


c 


p 


Q 


V 


p 


A 


568 


V 


A 


R 


A 


c 


p 


Q 


V 


s 


S 


569 


V 


A 


R 


A 


c 


p 


Q 


V 


- s 


A 


570 


V 


A 


R 


A 


c 


p 


Q 


1 


p 


S 


571 


V 


A 


R 


A 


c 


p 


Q 


1 


p 


A 


572 


V 


A 


R 


A 


c 


p 


Q 


1 


s 


S 


573 


V 


A 


R 


A 


c 


p 


Q 


1 


s 


A 


574 


V 


A 


R 


A 


c 


p 


R 


V 


p 


A 


575 


V 


A 


R 


A 


c 


p 


R 


V 


s 


S 


576 


V 


A 


R 


A 


c 


p 


R 


V 


s 


A 


577 


V 


A 


R 


A 


c 


p 


R 


I 


p 


S 


578 


V 


A 


R 


A 


c 


p 


R 


1 


p 


A 


579 


V 


A 


R 


A 


c 


p 


R 


1 


s 


S 


580 


V 


A 


R 


A 


c 


p 


R 


1 


s 


A 


581 


V 


A 


R 


A 


R 


s 


Q 


V 


p 


A 


582 


V 


A 


R 


A 


R 


s 


Q 


V 


s 


S 


583 


V 


A 


R 


A 


R 


s 


Q 


V 


s 


A 
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Table2. Nov I Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


584 


V 


A 


R 


A 


R 


s 


Q 


1 


P 


S 


585 


V 


A 


R 


A 


R 


s 


Q 


1 


P 


A 


586 


V 


A 


R 


A 


R 


s 


Q 


1 


S 


S 


587 


V 


A 


R 


A 


R 


s 


Q. 


1 


S 


A 


588 


V 


A 


R 


A 


R 


s 


R 


V 


P 


A 


589 


V 


A 


R 


A 


R 


s 


R 


V 


s 


S 


590 


V 


A 


R 


A 


R 


s 


R 


V 


s 


A 


5S|1 


V 


A 


R 


A 


R 


s 


R 


1 


p 


S 


592 


V 


A 


R 


A 


R 


s 


R 


1 


p 


A 


593 


V 


A 


R 


A 


R 


s 


R 


1 


s 


S 


594 


V 


A 


R 


A 


R 


s 


R 


1 


s 


A 


595 


V 


A 


R 


A 


R 


p 


Q 


V 


p 


A 


596 


V 


A 


R 


A 


R 


p 


Q 


V 


s 


S 


597 


V 


A 


R 


A 


R 


p 


Q 


V 


s 


A 


. 598 


V 


A 


R 


A 


R 


p 


Q 


1 


p 


S 


599 


V 


A 


R 


A 


R 


p 


Q 


1 


p 


A 


600 


V 


A 


R 


A 


R 


p 


Q 


1 


s 


S 


601 


V 


A 


R 


A 


R 


p 


Q 


1 


s 


A 


602 


V 


A 


R 


A 


R 


p 


R 


V 


p 


A 


603 


V 


A 


R 


A 


R 


p 


R 


V 


s 


S 


604 


V 


A 


R 


A 


R 


p 


R 


V 


s 


A 


605 


V 


A 


R 


A 


R 


p 


R 


1 


p 


S 


606 


V 


A 


R 


A 


R 


p 


R 


1 


p 


A 


607 


V 


A 


R 


A 


R 


p 


R 


1 


s 


S 


608 


V 


A 


R 


A 


R 


p 


R 


1 


s 


A 


609 


V 


A 


H 


E 


C 


s 


Q 


V 


p 


S 


610 


V 


A 


H 


E 


C 


s 


Q 


V 


p 


A 


611 


V 


A 


H 


E 


C 


s 


Q 


V 


s 


S 


612 


V 


A 


H 


E 


C 


s 


Q " 


V 


s 


A 


613 


V 


A 


H 


E 


C 


s 


Q 


1 


p 


S 


614 


V 


A 


H 


E 


C 


s 


Q 


1 


p 


A 


616 . 


V 


A 


H 


E 


c 


s 


Q 


1 


s 


S 


616 


V 


A 


H 


E 


c 


s 


Q 


1 


s 


A 


617 


V 


A 


H 


B 


c 


s 


R 


V 


p 


S 


618 


V 


A 


H 


E 


c 


s 


R 


V 


p 


A 


619 


V 


A 


H 


E 


c 


s 


R 


V 


s 


S 


620 


V 


A 


H 


E 


c 


s 


R 


V 


s 


A 


621. 


V 


A 


. H 


E ■ 


c 


s 


R 


1 


p 


S 


622 


V 


A 


H 


E 


c 


s 


R 


1 


p 


A 


623 


V 


A 


H 


E 


c 


s 


R 


1 


s 


S 


624 


V 


A 


H 


E 


c 


s 


R 


1 


s 


A 


625 


V 


A 


H 


E 


c 


p 


Q 


V 


p 


S 


626 


V 


A 


H 


E 


c 


p 


Q 


V 


p 


A 


627 


V 


A 


H 


E 


c 


p 


Q 


V 


s 


S 


628 


. V 


A 


H 


E 


c 


p 


Q 


V 


s 


A 


629 


V 


A 


H 


• E 


c 


p 


Q 


1 


p 


S 


630 


V 


A 


H 


E 


c 


p 


Q 


1 


p 


A 


631 


V 


A 


H 


E 


c 


p 


Q 


1 


s 


S 


632 


V 


A 


H 


E 


c 


p 


Q 


1 


s 


A 


633 


V 


A 


H 


E 


c 


p 


R 


V 


p 


S 


634 


V 


A 


H 


E 


c 


p 


R 


V 


p 


A 


635 


V 


A 


H 


E 


c 


p 


R 


V 


s 


S 


636 


V 


A 


H 


E 


c 


p 


R 


V 


s 


A 
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Table2> Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


679 


676 


752 


637 


V 


A 


H 


E 


C 


P 


R 


1 


P 


S 


638 


V 


A 


H 


E 


C 


P 


R 


1 


P 


A 


639 


V 


A 


H 


E 


C 


P 


R 


1 


s 


S 


640 


V 


A 


H 


E 


C 


P 


R 


1 


s 


A 


641 


V 


A 


H 


-E 


R 


S 


Q 


V 


p 


S 


642 


V 


A 


H 


E 


R 


s 


Q 


V 


p 


A 


643 


V 


A 


H 


E 


R 


s 


Q 


V 


s 


S 


644 


V 


A 


H 


E 


R 


s 


Q 


V 


s 


A 


645 


V 


A 


H 


. E 


R 


s 


Q 


1 


p 


S 


646 


V 


A 


H 


E 


R 


s 


Q 


1 


p 


A 


647 


V 


A 


H 


E 


R 


s 


Q 


1 


s 


S 


648 


V 


A 


H 


E 


R 


s 


Q 


1 


s 


A 


649 


V 


A 


H 


E 


R 


s 


R 


V 


p 


S 


650 


V 


A 


H 


E 


R 


s 


R 


V 


p 


A 


651 


V 


A 


H 


E 


R 


s 


R 


V 


s 


S 


652 


V 


A 


H 


E 


R 


s 


R 


V 


s 


A 


653 


V 


A 


H 


E 


R 


s 


R 


1 


p 


S 


654 


V 


A 


H 


E 


R 


s 


R 


1 


p 


A 


655 


V 


A 


H 


E 


R 


s 


R 


1 


s 


S 


656 


V 


A 


H 


E 


R 


s 


R 


1 


s 


A 


657 


V 


A 


H 


E 


R 


p 


Q 


V 


p 


S 


658 


V 


A 


H 


E 


R 


p 


Q 


V 


p 


A 


659 


V 


A 


H 


E 


R 


p 


Q 


V 


s 


S 


660 


V 


A 


H 


E 


R 


p 


Q 


V 


s 


A 


661 


V 


. A 


H 


E 


R 


p 


Q 


1 


p 


S 


662 


V 


A 


H 


E- 


R 


p 


Q 


1 


p 


A 


663 


V 


A 


H 


E 


R 


p 


Q 


1 


s 


S 


664 


V 


A 


H 


E 


R 


p 


Q 


. 1 


s 


A 


665 


V 


A 


H 


E 


R 


p 


R 


V 


p 


S 


666 


V 


A 


H 


E 


R 


p 


R 


V 


p 


A 


667 


V 


A 


H 


E 


R 


p 


R 


V 


s 


S 


668 


V 


A 


H 


E 


R 


p 


R 


V 


s 


A 


669 


V 


A 


H 


E 


R 


p 


R 


1 


p 


S 


670 


V- 


A 


H 


E 


R 


p 


R 


1 


p 


A 


671 


V 


A 


H 


E 


R 


p. 


R 


1 


s 


S 


672 


V 


A 


H 


E 


R 


p 


R 


1 


s 


A 


673 


V 


A 


H 


A 


C 


s 


Q 


V 


p 


S 


674 


V 


A 


H 


A 


C 


s 


Q 


V 


p 


A 


675 


V 


A 


H 


A 


C 


s 


Q 


V 


s 


S 


676 


V 


A 


H 


A 


C 


s 


Q 


V 


s 


A 


677 


V 


A 


H 


A 


c 


s 


Q 


1 


p 


S 


678 


V 


A 


H 


A 


c 


s 


Q 


1 


p 


A 


679 


V 


A 


H 


A 


c 


s 


Q 


1 


s 


S 


680 


V 


A 


H 


A 


c 


s 


Q 


1 


s 


A 


681 


V 


A 


H 


A 


c 


s 


R 


V 


p 


S 


682 


V 


A 


H 


A 


c 


s 


R 


V 


p 


A 


683 


V 


A 


H 


A 


c 


s 


R 


V 


s 


S 


684 


V 


A 


H 


A 


c 


s 


R 


V 


s 


A 


685 


V 


A 


H 


A 


c 


s 


R 


1 


p 


S 


686 


V 


A 


H 


A 


c 


s 


R 


1 


p 


A 


687 


V 


A 


H 


A 


c 


s 


R 


1 


s 


S 


688 


V 


A 


H 


A 


c 


s 


R 


1 


s 


A 


689 


V 


A 


H 


A 


c 


p 


Q 


V 


p 


S 
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Table2. Novel Polymorphic Variant of IL4Ra (conrd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


690 


V 


A 


H 


A 


C 


P 


Q 


V 


P 


A 


691 


V 


A 


H 


A 


C 


P 


Q 


V 


s 


S 


692 


V 


A 


H 


A 


C 


P 


Q 


V 


s 


A 


693 


V 


A 


H 


A 


C 


P 


Q 


1 


P 


S 


694 


V 


A 


H 


A 


C 


P 


Q 


1 


p 


A 


695 


V 


A 


H 


A 


C 


P 


Q 


1 


s 


S 


696 


V 


A 


H 


A 


C 


P 


Q 


1 


s 


A 


697 


V 


A 


H 


A 


C 


P 


R 


V 


p 


S 


698 


V 


A 


H 


A 


C 


P 


R 


V 


p 


A 


699 


V 


A 


H 


A 


C 


P 


R 


V 


s 


S 


700 


V 


A 


H 


A 


C 


P 


R 


V 


s 


A 


701 


V 


A 


H 


A 


C 


P 


R 


1 


p 


S 


702 


V 


A 


H 


A 


C 


P 


R 


1 


p 


A 


703 


V 


A 


H 


A 


C 


P 


R 


1 


s 


S 


704 


V 


A 


H 


A 


C 


P 


R 


1 


s 


A 


705 


V 


A 


H 


A 


R 


S 


Q 


V 


p 


S 


706 


V 


A 


H 


A 


R 


s 


Q 


V 


p 


A 


707 


V 


A 


H 


A 


R 


s 


Q 


V 


s 


S 


708 


V 


A 


H 


A 


R 


s 


Q 


V 


s 


A 


709 


V 


A 


H 


A 


R . 


s 


Q 


1 


p 


S 


710 


V 


A 


H 


A 


R 


s 


Q 


1 


p 


A 


711 


V 


A 


H 


A 


R 


s 


Q 


1 


s 


S 


712 


V 


A 


H 


A 


R 


s 


Q 


1 


s 


A 


713 


V 


A 


H 


A 


R 


s 


R 


V 


p 


S 


714 


V 


A 


H 


A 


R 


s 


R 


V 


p 


A 


715 


V 


A , 


H 


A 


R 


s 


R 


V 


s 


S 


716 


V 


A 


H 


A 


R 


s 


R 


V 


s 


A 


717 


V 


A 


H 


A 


R 


s 


R 


1 


p 


S 


718 


V 


A 


H 


A 


R 


s 


R 


1 


p 


A 


719 


V 


A 


H 


A 


R 


s 


R 


1 


s 


S 


720 


V 


A 


H 


A 


R 


s 


R 


1 


s 


A 


721 - 


V 


A 


H 


A 


R 


P 


Q 


V 


p 


S 


722 


V 


A 


H 


A 


R 


P 


Q 


V 


p 


A 


723 


V 


A 


H 


A 


R 


. p 


Q 


V 


s 


S 


724 


V 


A 


H 


A 


R 


p 


Q 


V 


s 


A 


725 


V 


A 


H 


A 


R 


p 


Q 


1 


p 


S 


726 


V 


A 


H 


A 


R 


p 


Q 


1 


p 


A 


.727 


V 


A 


H 


A 


R 


p 


Q 


1 


s 


S 


728 


V 


A 


H 


A 


R 


p 


Q 


1 


s 


A 


729 


V 


A 


H 


A 


R 


p 


R 


V 


p 


S 


730 


V 


A 


H 


A 


R 


p 


R 


V 


p 


A 


731 


V 


A 


H 


A 


R 


p 


R 


V 


s 


S 


732 


V 


A 


H 


A 


R 


. p 


R 


V 


s 


A 


733 


V 


A 


H 


A 


R 


p 


R 


1 


p 


S 


734 


V 


A 


H 


A 


R 


p 


R 


1 


p 


A 


735 


V 


A 


H 


A 


R 


p 


R 


1 


s 


S 


736 


V 


A 


H 


A 


R 


p 


R 


1 


s 


A 


737 


V 


T 


R 


E 


C 


s 


Q 


V 


p 


S 


738 


V 


T 


R 


E 


C 


s 


Q 


V 


p 


A 


739 


V 


T 


R 


£ 


C 


s 


Q 


V 


s 


S 


740 


V 


T 


R 


E 


C 


s 


Q 


V 


s 


A 


741 


V 


T 


R 


E 


C 


s 


Q 


1 


p 


S 


742 


V 


T 


R 


E 


C 


s 


Q 


1 


p 


A 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 
Number 


Amino Acid Position and Identities 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


743 . 


V 


T 


R 


E 


C 


S 


Q 


1 


S 


S 


744 


V 


T 


R 


E 


C 


S 


Q 


1 


S 


A 


745 


V 


T 


R 


E 


c 


S 


R 


V 


P 


S 


746 


V 


T 


R 


. E 


c 


S 


R 


V 


P 


A 


747 


V 


T 


R 


E 


c 


S 


R 


V 


s 


S 


748 


V 


T 


R 


E 


c 


S 


R 


V 


s 


A 


749 


V 


T 


R 


E 


c 


S 


R 


1 


p 


S 


750 


V 


T 


R 


E 


c 


s 


R 


1 


p 


A 


751 


V 


■ T 


R 


E 


c 


s 


R 


1 


s 


S 


752 


V 


T 


R 


E 


c 


s 


R 


1 


s 


A 


753 


V 


T 


R 


E 


c 


p 


Q 


V 


p 


S 


754 


. V 


T 


R 


E 


c 


p 


Q 


V 


p 


A 


755 


V 


T 


R 


E 


c 


p 


Q 


V 


s 


S 


756 


V 


T 


R 


E 


c 


p 


Q 


V 


s 


A 


757 


V 


T 


R 


E 


c 


p 


Q 


1 


p 


S 


758 


V 


T 


R 


E - 


c 


p 


Q 


1 


p 


A 


759 


V 


T 


R 


. E 


c 


p 


Q 


1 


s 


S 


760 


V 


T 


R 


E 


c 


p 


Q 


1 


s 


A 


761 


V 


T 


R 


E 


c 


p 


R 


V 


p 


S 


762 


V 


T 


R 


E 


c 


p 


R 


V 


p 


A 


763 


V 


T 


R 


E 


c 


p 


R 


V 


s 


S 


764 


V 


T 


R 


E 


c 


p 


R 


V 


s 


A 


765 


V 


T 


R 


E 


c 


p 


R 


^ 


p 


S 


766 


V 


T 


R 


E 


c 


p 


R 


1 


p 


A 


767 


V 


T 


R 


E 


c 


p 


R 


1 


s 


S 


768 


V 


T 


R 


E 


c 


p 


R 


1 


s 


A 


769 


V 


T 


R 


E 


R 


s 


Q 


V 


p 


S 


770 


V 


T 


R 


E 


R 


s 


Q 


V 


R 


A 


771 


V 


T 


R 


E 


R 


s 


Q 


V 


s 


S 


772 


V 


- T 


R 


E 


R 


s 


Q 


V 


s 


A 


773 


V 


T 


R 


E 


R 


s 


Q 


1 


p 


S 


774 


V 


T 


R 


E 


R 


s 


Q 


1 


p 


A 


775 


V 


T 


R 


E 


R 


s 


Q 


1 


s 


S 


776 


V 


T 


R 


E 


R 


s 


Q 


1 


s 


A 


777 


V 


T 


R 


E 


R 


s 


R 


V 


p 


S 


778 


V 


T 


R 


E 


R 


s 


R 


V 


p 


A 


779 


V 


T 


R 


E 


R 


s 


R 


V 


s 


S 


780 


V 


T 


R 


E 


R 


s 


R 


V 


s 


A 


781 


V 


T 


R 


E ' 


R 


s 


R 


1 


p 


S 


782 


V 


T 


R 


E 


R 


s 


R 


1 


p 


A 


783 


V 


T 


R 


E 


R 


s 


R 


1 


s 


S 


784 


V 


T 


R 


E 


R 


s 


R 


1 


s 


A 


785 


V 


T 


R 


E 


R 


p 


Q 


V 


p 


S 


786 


V 


T 


R 


E 


R 


p 


Q 


V 


p 


A 


787 


V 


T 


R 


E 


R 


p 


Q 


V 


s 


S 


788 


V 


T 


R 


E 


R 


p 


Q 


V 


s 


A 


789 


V 


T 


R 


E 


R 


p 


Q 


1 


p 


S 


790 


V 


T 


R 


E 


R 


p 


Q 


1 


p 


A 


791 


V 


T 


R 


E 


R 


p 


Q 


1 


s 


S 


792 


V 


T 


R 


E 


R 


p 


Q 


1 


s 


A 


793 


V 


T 


R 


E 


R 


p 


R 


V 


p 


S 


794 


V 


T 


R 


E 


R 


p 


R 


V 


p 


A 


795 


V 


T 


R 


E 


R 


p 


R 


V 


s 


S 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


676 


752 


796 


V 


T 


R 


E 


R 


P 


R 


V 


S 


A 


797 


V 


T 


R 


E 


R 


P 


R 


1 


P 


S 


798 


V 


T 


R 


E 


R 


P 


R 


1 


P 


A 


799 


V 


T 


R 


E 


R 


P 


R 


1 


S 


S 


800 


V 


T 


R 


E 


R 


P 


R 


1 


s 


A 


801 


V 


T 


R 


A 


C 


S 


Q 


V 


p 


S 


802 


V 


T 


R 


A 


c 


S 


Q 


V 


p 


A 


803 


V 


T 


R 


A 


c 


S 


Q 


V 


s 


S 


804 


V 


T 


R 


A 


c 


S 


Q 


V 


s 


A 


805 


V 


T 


R 


A 


c 


S 


Q 


1 


p 


S 


806 


V 


T 


R 


A 


c 


S 


Q 


1 


P- 


A 


807 


V 


T 


R 


A 


c 


s 


Q 


1 


s 


S 


808 


V 


T 


R 


A 


c 


s 


Q 


1 


s 


A 


809 


V 


T 


R 


A 


c 


s 


R 


V 


p 


S 


810 


V 


T 


R 


A 


c 


s 


R 


V 


p 


A 


811 


V 


T 


R 


A 


c 


s 


R 


V 


s 


S 


812 


V 


T 


R 


A 


c 


s 


R 


V 


s 


A 


813 


V 


T 


R 


A 


c 


s 


R 


1 


p 


S 


814 


V 


T 


R 


A 


c 


s 


R 


1 


p 


A 


815 


V 


T 


R 


A 


c 


s 


R 


1 


s 


S 


816 


V 


T 


R 


A 


c 


s 


R 


1 


s 


A 


817 


V 


T 


R 


A 


c 


p 


Q 


V 


p 


S 


818 


V 


T 


R 


A 


c 


p 


Q 


V 


p 


A 


819 


V 


T 


R 


A 


c 


p 


Q 


V 


s 


S 


820 


V 


T 


R 


A 


c 


p 


Q 


V 


s 


A 


821 


V 


T 


R 


A 


c 


p 


Q 


1 


p 


S 


822 


V 


T 


R 


A 


c 


p 


Q 


1 


p 


A 


823 


V 


T 


R 


A 


c 


p 


Q 


1 


s 


S 


824 


V 


T 


R 


A 


c 


p 


Q 


I 


s 


A 


825 


V 


T 


. R 


A 


c 


p 


R 


V 


p 


S 


826 


V 


T 


R 


A 


c 


p 


R 


V 


p 


A 


827 


V 


T 


R 


A 


c 


p 


R 


V 


s 


S 


828 


V 


T 


R 


A 


c 


p 


R 


V 


s 


A 


829 


V 


T 


R 


A 


c 


p 


R 


1 


p 


S 


830 


V 


T 


R 


A 


c 


p 


R 


1 


p 


A 


831 . 


V 


T 


R 


A 


c 


p 


R 


1 


s 


S 


832 


V 


T 


R 


A 


c 


p 


R 


1 


s 


A 


833 


V 


T 


R 


A 


-R 


s 


Q 


V 


p 


S 


834 


V 


T 


R 


A 


R 


s 


Q 


V 


p 


A 


835 


V 


T 


R 


A 


R 


s 


Q 


V 


s 


S 


836 


V 


T 


R 


A 


R 


s 


Q 


V 


s 


A 


837 


V 


T 


R 


A 


R 


s 


Q 


1 


p 


S 


838 


V 


T 


R 


A 


R 


s 


Q 




p 


A 


839 


V 


T 


R 


A 


R 


s 


Q 


1 


s 


S 


.840 


V 


T 


R 


A 


R 


s 


Q • 


1 


s 


A 


841 


V 


T 


R 


A 


R 


s 


R 


V 


p 


S 


842 


V 


T 


R 


A 


R 


s 


R 


V 


p 


A 


843 


V 


T 


R 


A 


R 


s 


R 


V 


s 


S 


844 


V 


T 


R 


A 


R 


s 


R 


V 


s 


A 


845 


V. 


T 


R 


A 


R 


s 


R 


. 1 


p 


S 


846 


V 


T 


R 


A 


R 


s 


R 


1 


p 


A 


847 


V 


T 


R 


A 


R 


s 


R 


1 


s 


S 


848 


V 


T 


R 


A 


R 


s 


R 


1 


s 


A 
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f ableZ Novel Polymorphic Variant of IL4Ra (cx>nycl) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 


849 


V 


T 


R 


A * 


R 


P 


Q 


V 


P 


S 


850 


V 


T 


R 


A 


R 


P 


Q 


V 


P 


A 


851 


V 


T 


R 


A 


R 


P 


Q 


V 


s 


S 


852 


V 


T 


R 


A 


R 


P 


Q - 


V 


s 


A 


853 


V 


T 


R 


A 


R 


P 


Q 


1 


p 


S 


854 


V 


T 


R 


A 


R 


P 


Q 


1 


p 


A 


855 


V 


T 


R 


A 


R 


P 


Q 


1 


s 


S 


856 


V 


T 


R 


A 


R 


P 


Q 


1 


s 


^ A 


857 


V 


T 


R 


A 


R 


P 


R 


V 


p 


S 


858 


V 


T 


R 


A 


R 


P 


R 


V 


p 


A 


859 


V 


T 


R 


A 


R 


P 


R 


V 


s 


S 


860 


V 


T 


R 


A 


R 


P 


R 


V 


s 


A 


861 


V 


T 


R 


A 


R 


P 


R 


1 


p 


S 


862 


V 


T 


R 


A 


R 


P 


. R 


1 


p 


A 


863 


V 


T 


R 


A 


R 


P 


R 


1 


s 


S 


864 


V 


T 


R 


A 


R 


P 


R 


1 


s 


A 


865 


V 


T 


H 


. E 


C 


s 


Q 


V 


p 


S 


866 


V 


T 


H 


E 


C 


s 


Q 


V 


p 


A 


867 


V 


T 


H 


E 


C 


s 


Q 


V 


s 


S 


868 


V 


T 


H 


. E 


C 


s 


Q 


V 


s 


A 


869 


V 


T 


H 


E 


C 


s 


Q 


i 


p 


S 


870 


V . 


T 


H 


E 


c 


s 


Q 


1 


p 


A 


871 


V 


T 


H 


E 


c 


s 


Q 


1 


s 


S 


872 


V 


T 


H 


E 


c 


s 


Q 


1 


s 


A 


873 


V . 


T 


H 


E 


c 


s 


R 


V 


p 


S 


874 


V 


T 


H 


E 


c 


s 


R 


V 


p 


A 


875 


V 


T 


H 


E 


c 


s 


R 


V 


s 


S 


876 


V 


T 


H 


E 


c 


s 


R 


V 


s 


A 


877 


V 


T 


H 


E 


c 


s 


R 


1 


p 


S 


878 


V 


T 


H 


E 


c 


s 


R 


1 


p 


A 


879 


V 


T 


H 


E 


c 


s 


R 


1 


s 


S 


880 


V 


T . 


H 


E 


c 


s 


R 


1 


s 


A 


881 


V 


T 


H 


E 


c 


P 


Q 


V 


p 


S 


882 


V 


T 


H 


E 


c 


p 


Q 


V 


p 


A 


883 


V 


T 


H 


E 


c 


p 


Q 


V 


s 


S 


884 


V 


T 


H 


E 


c 


p 


Q 


V 


s 


A 


885 


V 


T 


H 


E 


c 


p 


Q 


1 


p 


S 


886 


V 


T 


H 


. E 


c 


p 


Q 


1 


p 


A 


887 


V 


T 


H 


E 


c 


p 


Q 


1 


s 


S 


888 


V 


T 


H 


E 


c 


p 


Q 


1 


s 


A 


889 


V 


. T 


H 


E 


c 


p 


R 


V 


p 


S 


890 


V 


T 


H 


E 


c 


p 


R 


V 


p 


A 


891 


V 


T 


H 


E 


c 


p 


R 


V 


s 


S 


892 


V 


T 


H 


E 


c 


p 


R 


V 


s 


A 


893 


V 


T 


H 


E 


c 


p 


R 


1 


p 


S 


894 


V 


T 


H 


E 


c 


p 


R 




p 


A 


895 


V 


T 


H 


E 


c 


p 


R- 


1 


s 


S 


896 


V 


T 


H 


E 


c 


p 


R 


1 


s 


A 


897 


V 


T 


H 


E 


R 


s 


Q 


V 


p 


S 


898 


V 


T 


H 


E 


R 


s 


Q 


V 


p 


A 


899 


V 


T 


H 


E 


R 


s 


Q 


V 


s 


S 


900 


V 


T 


H 


E 


R 


s 


Q 


V 


s 


A 


901 


V 


T 


H 


E 


R 


s 


Q 


1 


p 


S 
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TabI 2. Novel Polymorphic Variant of IL4Ra (conf d) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


762 


902 


V 


T 


H 


E 


R 


s 


Q 


1 


P 


A 


903 


V 


T 


H 


E • 


R 


s 


Q 


1 


S 


S 


904 


V 


T 


H 


E 


R 


s 


Q 


1 


s 


A 


905 


V 


T 


H 


E 


R 


s 


R 


V 


p 


S 


906 


V 


T 


H 


E 


R 


s 


R 


V 


p 


A 


907 


V 


T 


H 


E 


R 


s 


R 


V 


s 


S 


908 


V 


T 


H 


E 


R 


s 


R 


V 


s 


A 


909 


V 


T 


H 


E 


R 


s 


R 


1 


p 


S 


910 


V 


T 


H 


E 


R 


s 


R 


1 


p 


A 


911 


V 


T 


H 


E 


R 


s 


R 


1 


s 


S 


912 


V 


T 


H 


E 


R 


s 


R 


i 


s 


A 


913 


V 


T 


H 


• E 


R 


p 


Q 


V 


p 


S 


914 


V 


T 


H 


E 


R 


p 


Q 


V 


p 


A 


916 


V 


' T 


H 


E 


R 


p 


Q 


V 


s 


S 


916 


V 


T 


H 


E 


R 


p 


Q 


V 


s 


A 


917 


V 


T 


H 


E 


R 


p 


Q 


1 


p 


S 


918 


V 


T 


H 


E 


R 


p 


Q 


1 


p 


A 


919 


V 


T 


H 


E 


R 


p 


Q 


1 


s 


S 


920 


V 


T 


H 


E 


R 


p 


Q 


. 1 


s 


A 


921 


V 


T 


H 


E 


R 


p 


R 


V 


p 


S 


922 


V 


T 


H 


E 


R 


p 


R 


V 


p 


A 


923 


V 


T 


H 


E . 


R 


p 


R 


V 


s 


S 


924 


V 


T 


H 


E 


R 


p 


R 


V 


s 


A 


925 


V 


T 


H 


E 


R 


p 


R 


1 


p 


S 


926 


V 


T 


H 


E 


R 


p 


R 


1 


p 


A 


927 


V 


T 


H 


E 


R 


p 


R 


1 


s 


S 


928 


V 


T 


H 


E 


R 


p 


R 


1 


s 


A 


929 


V 


T 


H 


A 


C 


s 


Q 


V 


p 


S 


930 


V 


T 


H 


A 


C 


s 


Q 


V 


p 


A 


931 


V 


T 


H 


A 


C 


s 


Q 


V 


s 


S 


932 


V 


T 


H 


A 


C 


s 


Q 


V 


s 


A 


933 


V 


T 


H 


A 


c 


s 


Q 


1 


p 


S 


934 


V 


T 


H 


A 


c 


s 


Q 


1 


p 


A 


935 


V 


T 


H 


A 


c 


s 


Q 


1 


s 


S 


936 


V 


T 


H 


A 


c 


s 


Q 


1 


s 


A 


937 


V 


T 


H 


A 


c 


s 


R 


V 


p 


S 


938 


V 


T 


H 


A 


c 


s 


R 


V 


p 


A 


939 


V 


T 


H 


A 


c 


s 


R 


V 


s 


S 


940 


V 


T 


H 


A 


c 


s 


R 


V 


s 


A 


941 


V 


T 


H 


A 


c 


s 


R 


1 


p 


S 


942 


V 


T 


H 


A 


c 


s 


R 


1 


p 


A 


043 


V 


T 


H 


A 


c 


s 


R 


1 


s 


S 


944 


V 


T 


H 


A 


c 


s 


R 


1 


s 


A 


945 


V 


T 


H 


A 


c 


p 


Q 


V 


p 


S 


946 


V 


T 


H 


A 


c 


p 


Q 


V 


p 


A 


947 


V 


T 


H 


A 


c 


p 


Q 


V 


s 


S 


948 


V 


T 


H 


A 


c 


p 


Q 


V 


s 


A 


949 


V 


T . 


H 


A 


c 


p 


Q 


1 


p 


S 


950 


V 


T 


H 


A 


c 


p 


Q 


. 1 


p 


A 


951 


V 


T 


H 


A 


c 


p 


Q 


1 


s 


S 


952 


V 


T 


H 


A 


c 


p 


Q 


1 


s 


A 


953 


V 


T 


H 


A 


c 


p 


R 


V 


p 


S 


954 


V 


T 


H 


A 


c 


p 


R 


V 


p 


A 
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Table2. Novel Polymorphic Variant of IL4Ra (cont'd) 



Polymorphic 
Variant 



Amino Acid Position and Identities 



Number 


75 


82 


185 


400 


431 


503 


576 


579 


675 


752 
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10 



The invention also includes IL4Ra peptide variants, which are any fragments of an IL4Ra 
protein variant that contains one or more of the amino acid variations shown in Table 2. An DL4Ra 
peptide variant is at least 6 amino acids in length and is preferably any number between 6 and 30 amino 
acids long, more preferably between 10 and 25, and most preferably between 15 and 20 amino acids long. 
Such IL4Ra peptide variants may be useful as antigens to generate antibodies specific for one of the 
above IL4Ra isofonns. In addition, the IL4Ra peptide variants may be useful in drug screening assays. 

An IL4Ra variant protein or peptide of the invention may be prepared by chemical synthesis or 

by e^qiressing one of the variant IL4Ra genomic and cDNA sequences as described above. Alternatively, 

the IL4Ra protein variant may be isolated from a biological sample of an individual bavins an IL4Ra 

35 
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isogene which encodes the variant protein. Where the sample contains two different IL4Rct isoforms 
(i.e., the individual has different IL4Ra isogenes), a particular IL4Ra isoform of the invention can be 
- isolated by immunoaffinity chromatography using an antibody which specifically binds to that particular 
' IL4Ra isoform but does not bind to the other IL4Ra isoform. 
5 The expressed or isolated IL4Ra protein may be detected by methods known in the art, including 

Coomassie blue staining, silver staining, and Western blot analysis using antibodies specific for the 
isoform of the. IL4Ra protein as discussed further below. IL4Ra variant proteins can be purified by 
standard protein pimfication procedures known in the art, including differential precipitation, molecular 
sieve chromatography, ion-exchange chromatography, isoelectric focusing, gel electrophoresis, affinity 

10 and iimnunoafiinity chromatography and the like. (Ausubel et. al., 1 987, In Current Protocols in 
Molecular Biology John Wiley and Sons, New York, New York). In the case of immunoaffinity 
chromatogmphy, antibodies specific for a particular polymorphic variant may be used. . 

A polymorphic variant IL4Ra gene of the invention may also be fused in fi-ame with a 
heterologous sequence to encode a chimeric IL4Ra protein. The non-IL4Ra portion of the chimeric 

1 5 protein may be recognized by a commercially available antibody. In addition, the chuneric protein may 
also be engineered to contain a cleavage site located between the IL4Ra and non-IL4Ra portions so that 
the IL4Ra protdn may be cleaved and purified away fit>m the non-IL4Ra portion. 

An additional ^bodiment of the invention relates to using a novel IL4Ra protein isoform in any 
of a variety of drug screening assays. Such screening assays may be performed to identify agents that 

20 bind specifically to all known IL4Ra protein isoforms or to only a subset of one or more of these 
isoforms. The agents may be from chemical compound libraries, peptide libraries and the like. The 
n^Ra protein or pq)tide variant nfiay be free in solution or affixed to a solid support. In one 
embodiment, high throughput screening of compoimds for binding to an IL4Ra variant may be 
accomplished using the method described in PGT application WO84/03565, in which large numbers of 
. 25 test compounds are synthesized on a solid substrate, such as plastic pins or some other surface, contacted 
with the IL4Ra protein(s) of interest and then washed. Bound IL4Ra protein(s) are then detected using 
methods well-known in the art. ' 

In another embodiment, a novel IL4Ra protein isoform may be used in assays to measure the 
binding affinities of one or more candidate drugs targeting the IL4Ra protein. 

30 In another embodiment, the invention provides antibodies specific for and immunoreactive with 

one or more of the novel IL4Ra variant proteins described herein. - The antibodies may be either 
monoclonal or polyclonal in origin. The IL4Ra protein or peptide variant used to generate the antibodies 
may be fi:om natuml or recombinant sources or produced by chemical synthesis using synthesis 
techniques known in the art. If the IL4Ra protein variant is of insufficient size to be antigenic, it may be 

35 conjugated, complexed, or otherwise covalently linked to a carrier molecule to enhance the antigenicity of 

the peptide. Examples of carrier molecules, include, but are not limited to, albumins (e.p.. human. 
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bovine, fish, ovine), and keyhole limpet hemocyanin (Basic and Clinical Immunology, 1991, Eds, D.P, 
Stites, and A.I. Terr, Appleton and Lange, Norwalk Connecticut, San Mateo, California). 

In one embodiment, an antibody specifically immunorcactive with one of the novel IL4Ra 
protein isoforms described herein is administered to an individual to neutralize activity of the IL4Ra 
5 isofonn expressed by that individual. The antibody may be. formulated as'a pharmaceutical composition , 
which includes a pharmaceutically acceptable carri^. 

Antibodies specific for and immunorcactive with one of the novel IL4Ra protein isoform 
described herein may be used to inmiunoprecipitate the IL4Ra protein variant from solution as well as 
react with IL4Ra protein isoforms on Westem or immimoblots of poiyacrylamide gels on membrane 

10 supports or substrates. In another preferred embodiment, the antibodies will detect IL4Ra protein 

isoforms in pmaffin or frozen tissue sections, or in cells which have been fixed or unfixed and prepared 
on slides, coverslips, or the like, for use in inmiunocytochemical, immunohistochemical, and 
immunofluorescence techniques. 

In another embodiment, an antibody specifically immunoreactive with one of the novel IL4Ra 

15 protein variants described herein is used in immuxioassays to detect this variant in biological samples. In 
this method, an antibody of the present invention is contacted with a biological sample and the formation 
of a complex between the IL4Ra protein variant and the antibody is detected. As described, suitable 
iromunoassays include radioimmunoassay, Westem blot assay, immunofluorescent assay, enzyme linked 
immunoassay (ELISA), chemiluminescent assay, inununohistochemical assay, immunocytochemical 

20 assay, and the like (see, e.g.. Principles and Practice of Immunoassay, 1991 , Eds. Christopher P. Price and 
David J. Nepman, Stockton Press, New York, New York; Current Protocols in Molecular Biology, 1 987, 
Eds. Ausubel et aL, John Wiley and Sons, New York, New York). Standard techniques known in the art 
for ELISA are described in Methods in Immunodiagnosis, 2nd Ed., Eds. Rose and Bigazzi, John Wiley 
and Sons, New York 1980; and Campbell et al., 1984, Methods in Immunology, W.A. Benjaniin, Inc.). 

25 Such assays may be direct, indirect, competitive, or noncompetitive as described in the art (see, e.g.. 

Principles and Practice of Immunoassay, 1991, Eds. Christopher P. Price and David J. Neoman, Stockton 
Pres, NY, NY; and Oelhrich, M., 1984, J. Clin. Chran. Clin. Biochem., 22:895-904). Proteins may be 
isolated from test specimens and biological samples by conventional methods, as described in Cuirent 
Protocols in Molecular Biology, supra. 

30 Exenqplary antibody molecules for use in the detection and therapy methods of the present 

invention are intact immunoglobulin molecules, substantially intact immunoglobulin miolecules, or those 
portions of immunoglobulin molecules that contain the antigen binding site. Polyclonal or. monoclonal 
antibodies may be produced by methods conventionally known in the art (e.g., Kohler and Milstein, 
1975, Nature, 256:495-497; Campbell Monoclonal Antibody Technology, the Production and 

35 Characterization of Rodent and Human Hybridomas, 1985, In: Laboratory Techniques in Biochemistry 
and Molecular Biology, Eds, Burdon et al.. Volume 13, Elsevier Science Publishers, Amsterdam). The 
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antibodies or antigen binding fragments thereof may also be produced by genetic engineenng. The 
technology for e^qiiession of both heavy and Ught chain genes in £. coli is die subject of PCT patent 
applications, publication number WO 901443, WO 901443 and WO 9014424 and in Huse et al., 1989, 
Science, 246:1275-1281, The antibodies may also be humanized (e.g.. Queen, C. et al. 1989 Proc. 
5 Nad. Acad. Sci. 86; 10029). 

£frect(s) of the polymorphisms identified herein on expression of IL4Ra may be investigated by 
preparing recombinant cells and/or organisms, preferably recombinant animals, containing a polymorphic 
variant of the IL4Ra gene. As used herein, **expression" includes but is not limited to one or more of the 
following: transcription of the gene into precursor mKNA; splicing and other processing of the precursor 

10 mRNA to produce mature mRNA; mRNA stability; translation of the mature mRNA into IL4Ra protein 
(including codon usage and tRNA availability); and glycosylation and/or other modifications of the 
translation product, if required for proper egression and function. 

To prepare a recombinant cell of the invention, the desired IL4Ra isogene may be introduced into 
the cell in a vector such that the isogene remains extrachromosomaL In such a situation, the gene will be 

IS esqyressed by the cell from the extrachromosomal location. In a preferred embodiment, the IL4Ra 
isog^e is introduced into a cell in such a way that it recombines with the endogenous IL4Ra gene 
present in the ceU. Such recombination requires the occurrence of a double recombination event, thereby 
resulting in the desired IL4Ra gene polymorphism. Vectors for the introduction of genes both for 
recombination and for extrachromosomal maintenance are known in the art, and any suitable vector or 

20 vector construct may be used in the invention. Methods such as electroporation, particle bombardment, 
calcium phosphate co-precipitation and viral transduction for introducing DNA into cells are known in , 
the art; therefore, the choice of method may lie with the competence and preference of the skOled 
practitioner. Examples of cells into which the IL4Ra isogene may be introduced include, but are not 
limited to, continuoxis culture cells, such as COS, NIH/3T3, and primary or culture cells of the relevant 

25 tissue type, i.e., they express the IL4Ra isogene. Such recombinant cells can be used to compare the 
biological activities of the different protein variants. 

Recombinant organisms, i.e., transgenic animals, expressing a variant IL4Ra gene are prepared 
using standard procedures known in the art. Preferably, a construct comprising the variant gene is 
introduced into a nonhuman animal or an ancestor of the animal at an ^nbryonic stage, i.e., the one-cell 

30 ' stage, or generally not later than about the eight-cell stage. Transgenic animals carrying the constructs of 
the invention can be made by several methods known to those having skill in the art. One method 
involves tiansfecting into the embryo a retrovirus constructed to contain one or more insulator elements, a 
gene or genes of interest, and other components known to those skilled in the art to provide a complete 
shuttle vector harboring the msulated gene(s) as a transgene, see e.g., U.S. Patent No. 5,610,053. Another 

. 35 method involves directly injecting a transgene into the embryo. A third method involves the use of 

embryonic stem cells. Examples of animals into which the IL4Ra isogenes may be introduced include, 
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but are not limited to, mice, rats, other rodents, and nonhuman primates (see "The Introduction of Foreign 
Genes into Mice" and the cited references therein, In: Recombinant DNA, Eds. J.D. Watson, Gibnan, 
J. Witkowski, and M. ZoUer; :W.H. Freeman and Company, New York, pages 254-272). Transgenic 
animals stably expressing a human IL4Ra isogene and producing human IL4Ra protein can be used as 
5 biological models for studying diseases related to abnormal IL4Ra expression and/or activity, and for 
screening and assaying various candidate drugs, compounds, and treatment regimens to reduce the 
symptoms or effects of these diseases. 

An additional embodiment of the invention relates to pharmaceutical compositions for treating 
disorders affected by expression or function of a novel IL4Ra isogene described herein. The 

10 pharmaceutical composition may comprise any of the following active ingredients: a polynucleotide 

comprising one of these novel IL4Ra isogdies; an antiseiise oligonucleotide directed against one of the 
novel IL4Ra isogenes, a polynucleotide encoding such an antisense oligonucleotide, or another 
compound which inhibits expression of a novel IL4Ra isogene described herein. Preferably, the 
composition contains the active ingredient in a ther^>eutically effective amount. By therapeutically 

1 5 effective amount is meant that one or more of the symptoms relating to disorders affected by expression 
or fimction of a novel IL4Ra isogene is reduced and/or eliminated. The composition also comprises a 
pharmaceutically acceptable carrier, examples of which include, but are not limited to, saline, buffered 
saHne, dextrose, and water. Those skilled in the ait may employ a formulation most suitable for the active 
ingredient, whether it is a polynucleotide, oligonucleotide, protein, peptide or small molecule antagonist. 

20 The pharmaceutical composition may be administered alone or in combination with at least one other 

agent, such as a stabihzing compound. Administration of the pharmaceutical composition may be by any 
number of routes including, but not limited to omi, intravenous, intramuscular, iutra-arterial, 
intramedullary, intrathecal, intraventricular, intradermal, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual, or rectal. Further details on techniques for formulation and 
. 25 administration may be found in the latest edition'of Remington's Pharmaceutical Sciences (Maack 
Publishing Co., Eastern, PA). 

For any composition, determination of the therapeutically effc^ctive dose of active ingredient 
and/or the appropriate route of administration is well within the c^bility of those skilled in the art For 
example, the dose can be estimated initially either in cell culture assays or in animal models. The animal 

30 model may also be used to determine the appropriate concentration range and route of administration. 
Such information can then be used to determine useful doses and routes for administration in humans. 
The exact dosage will be determine by the practitioner, in light of factors relating to the patient requiring 
treatment, including but not limited to severity of the disease state, geneml health, age, weight and gender 
of the patient, diet, time and frequency of administmtion, other drugs being taken by the patient, and 

35 tolerance/response to the treatment. 

Information on the identity of genotypes and haplotypes for the IL4Ra gene of any particular 
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individual as well as the frequency of such genotypes and haplotypes in any particular population of 
individuals is expected to be usefiil for a variety of basic research znd clinical applications. Thus, the 
invention also provides compositions and methods for detecting the novel IL4Ra polymorphisms 
identified herein. 

5 The compositions comprise at least one IL4Ra genotyping oligonucleotide* In one embodiment, 

an IL4Ra genotyping oligonucleotide is a probe or primer cqiable of hybridizing to a target region that is 
located close to, or that contains, one of the novel polymorphic sites described herein. As used herein, the 
term "oligonucleotide" refers to a polynucleotide molecule having less than about 100 nucleotides* A 
preferred oligonucleotide of the invention is 10 to 35 nucleotides long. More preferably, the 

10 oligonucleotide is between 15 and 30, and most preferably, between 20 and 25 nucleotides in length. The 
oligonucleotide may be comprised of any phosphorylation state of ribonucleotides^ deoxyribonucleotides, 
and acyclic nucleotide derivatives, and other functionally equivalent derivatives. Alternatively, 
oligonucleotides may have a phosphate-free backbone, which may be comprised of linkages such as . 
carboxymethyl, acetamidate, cari>amate, polyamide ^^tide nucleic acid (PNA)) and the like (Varma, R. 

15 in Molecular Biology and Biotechnology, A Comprehensive Desk Reference, £d. R. Meyers, VCH 

Publishers, Inc. (1995), pages 617-620). Oligonucleotides of the invention may be prepared by chemical 
synthesis using any suitable methodology known in the art, or may be derived from a biological sample, 
for example, by restriction digestion. The oligonucleotides may be labeled, according to any technique 
known in the art, including use of radiolabels, fluorescent labels, enzymatic labels, proteins, haptens, . 

20 antibodies, sequence tags and the like. 

Genotyping oligonucleotides of the invention must be capable of specifically hybridizing to a 
target region of an IL4Ra polynucleotide, i.e., an IL4Ra isogene. As used herein, specific hybridization 
means the oligonucleotide forms an anti-parallel double-stranded structure with llie target region under 
certain hybridizing conditions, while failing to form such a structure when incubated with a non-target 

25 region or a non-IL4Ra polynucleotide under the same hybridizing conditions. Prefaably, the 

oligonucleotide specifically hybridizes to the target region under conventional high stringency conditions. 
The skiUed artisan can readily design, and test oligonucleotide probes and primers suitable for detecting 
polymorphisms in the IL4Ra gene using the polymorphism information provided herein in conjunction 
with the known sequence information for the IL4Ra gene and routine techniques. 

30 A nucleic acid molecule such as an oligonucleotide or polynucleotide is said to be a '*perfect" or 

. "complete" complement of another nucleic acid molecule if every nucleotide of one of the molecules is 
complementary to the nucleotide at the corresponding position of the other molecule. A nucleic acid 
molecule is "substantially complementar)^' to another molecule if it hybridizes to that molecule with 
sufficient stability to remain in a duplex form under conventional low-stringency conditions. 

35 Conventional hybridization conditions are described, for example, by Sambrook J. et al., in Molecular 
Cloning, A Laboratory Manual, 2"^ Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989) 
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and by Haymes, B.D. et al. in Nucleic Acid Hybridization, A Practical Appr ach, IRL Press, Wasbington, 
b.C. (1985). While perfectly complementaiy oligonucleotides are preferred for detecting 
polymorphisms^ departures from complete complementarity are contemplated where such departures do 
not prevent the molecule from specifically hybridizing to the target region. For example, an 
5 oligonucleotide primer may have a non-complementary fragment at its 5 ' end, with the remainder of the 
primer being complementary to the target region. Alternatively, non-complementary nucleotides may be 
interspersed into the oligonucleotide probe or primer as long as the resulting probe or primer is still 
capable of specifically hybridizing to the target region. 

Preferred genotyping oligonucleotides of the invention are allele-specific oligonucleotides. As 

10 used herein, the term allele-specific oligonucleotide (ASO) means an oligonucleotide that is able, under 
sufficiently stringent conditions, to hybridize specifically to one allele of a gene, or other locus, at a target 
region containing a polymorphic site while not hybridizing to the corresponding region in another 
allele(s). As understood by the skilled artisan, allele-specificity will dqpend upon a variety of readily 
^ optimized stringency conditions, including ssalt and formamide concentrations, as well as t^nperatures for 

15 ' both the hybridization and washing steps. Examples of hybridization and washing conditions typically 
used for ASO probes are found in 'Kogan et al., "Genetic Prediction of Hemophilia A'* in PGR Protocols, 
A Guide to Methods and Applications, Academic Press, 1990 and Ruano et ah, 87 Proc. NatL Acad. Sdi. 
USA 6296-6300, 1990. Typically, an allele-specific oligonucleotide will be perfectiy complementary to 
one allele while containing a single mismatch for another allele. 

20 Allele-specific oligonucleotide probes which usually provide good discrimination between 

different alleles are those in which a central position of the oligonucleotide probe aligns with the 
polymoiphic site m tiie target region (e.g., approximately the 7*" or fi^ position in a 15 mer, the S"* or 9'* 
position in a 1 6mer, the 10^ or 1 1 position in a 20 mer). A preferred ASO probe for detecting IL4Ra 
gene polymorphisms comprises a nucleotide sequence, listed 5' to 3', selected fiiom the group consisting 

25 of : 
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TTGCACCACTGCACT 


(SEQ 


ID 


NO: 


4) 


and its complement. 


TTGCACCGCTGCACT 


.(SEQ 


ID 


NO: 


5) 


and its complement. 


TTTTGTGCTATTCCC 


(SEQ 


ID 


NO: 


6) 


and its complement, 


TTTTGTGTTATTCCC * 


(SEQ 


ID 


NO: 


7) 


and its complement. 


CTGGGCCGCTCAGGC 


(SEQ 


ID 


NO: 


8) 


and its complement/ 


CTGGGCCTCTCAGGC 


. (SEQ 


ID 


NO: 


9) 


and its complement. 


TAAGCCTGCGCTGGA 


(SEQ 


ID 


NO: 


10) 


and its complement. 


TAAGCCTCCGCTGGA 


(SEQ 


ID 


NO: 


11) 


and its complement. 


AGAACAACGGAGGCG 


(SEQ 


ID 


NO: 


12) 


and its complement. 


AGAACAATGGAGGCG 


(SEQ 


ID 


NO: 


13) 


and its complement. 


CGGAGGCGCGGGGTG 


(SEQ 


ID 


NO: 


14) 


and its complement. 


CGGAGGCACGGGGTG 


(SEQ 


ID 


NO: 


15) 


and its complement. 
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GTGCGGATAACTATA 
GTGCGGACAACTATA 


(SEQ 
(SEQ 


ID 
ID 


N0:16) 
N0:17) 


and 
and 


its 
its 


complement, 
complement. 


5 


CGGAGTGCGGCAGGG 
CGGAGTGTGGCAGGG 


(SEQ 
(SEQ 


ID 
ID 


N0:18) 
NO:19) 


and 
and 


its 
its 


complement, 
complement. 




GCCTGGGCTGAGGGT 
GCCTGGGTTGAGGGT 


(SEQ 
(SEQ 


ID 
ID 


NO:20) 
NO:21) 


and 
and 


its 
its 


complement, 
complement/ 


10 


TGGGGTGGGCAGGGG 
TGGGGTGAGCAGGGG 


(SEQ 
(SEQ 


ID 
ID 


N0:22) 
NO: 23) 


and 
and 


its 
its 


complement , 
complement. 


15 


TTCTCCCGCAGTGAA 
TTCTCCCACAGTGAA 


(SEQ 
(SEQ 


ID 
ID 


NO:24) 
NO: 25) 


and 
and 


its 
its 


complement , 
complement , 


GTGAAAACGACCCGG 
GTGAAAATGACCCGG 


(SEQ 
(SEQ 


ID 
ID 


NO:26) 
NO: 27) 


and 
and 


its 
its 


complement, 
complement. 


20 


GGCAAGCCCTGGGGC 
GGCAAGCTCTGGGGC 


(SEQ 
(SEQ 


ID 
ID 


NO:28) 
NO:29) 


and 
and 


its 
its 


complement, 
complement , 




GCCCTGGGGCTGGAT 
GCCCTGGAGCTGGAT 


(SEQ 
(SEQ 


ID 
ID 


NO: 30) 
NO: 31) 


and 
and 


its 
its 


complement, 
complement^ 


25 


ATAGCAAATCCCAGG 
ATAGCAATTCCCAGG 


(SEQ 
(SEQ 


ID 
ID 


NO: 32) 
NO: 33) 


and 
and 


its 
its 


complement, 
coifiplement. 


30 


GCTCTGCCCTAGGCA 
GCTCTGCACTAGGCA 


(SEQ 
(SEQ 


ID 
ID 


NO:34) 
NO: 35) 


and 
and 


its 
its 


complement, 
complement. 


CCCCCACCCCTCACA 
CCCCCACTCCTCACA 


(SEQ 
(SEQ 


ID 
ID 


NO: 36) 
NO:37) 


and 
and 


its 
its 


complement, 
complement. 


35 


TCCCTCCGCATCGCA 
TCCCTCCACATCGCA 


(SEQ 
(SEQ 


ID 
ID 


NO:38) 
NO:39) 


and 
and 


its 
its 


complement, 
complement. 




CACCTGCTGTGGTGT 
CACCTGCCGTGGTGT 


(SEQ 
(SEQ 


ID 
ID 


NO:40) 
NO: 41) 


and 
and 


its 
its 


complement, 
complemeint. 


40 


ATGTCTGAAGTAGAC 
ATGTCTGCAGTAGAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 42) 
NO: 43) 


• and 
and 


its 
its 


complement, 
complement , 


45 


TGACCAACCTTTGCT 
TGACCAACTTTTGCT 


(SEQ 
(SEQ. 


ID 
ID 


NO:44) 
NO: 45) 


and 
and 


its 
its 


complement , 
complement. 


■ CCTGTTTTCTGGAGC 
CCTGTTTCCTGGAGC 


(SEQ 
(SEQ 


ID 
ID 


NO: 46) 
NO: 47) 


and 
and 


its 
its 


complement, 
complement , 


50 


TGGACCTGCTCGGAG 
TGGACCTTCTCGGAG 


(SEQ 
(SEQ 


ID 
ID 


NO:48) 
NO: 49) 


and 
and 


its 
its 


complement, 
complement. 




AGTCATGCCTTCTTC 
AGTCATGTCTTCTTC 


(SEQ 
(SEQ 


ID 
ID 


NO:50) 
NO: 51) 


and 
and 


its 
its 


complement, 
complement. 


55 


GCCTTCTTCCACCTT 
GCCTTCTCCCACCTT 


(SEQ 
(SEQ 


ID 
ID 


NO: 52) 
NO: 53) 


and 
and 


its 
its 


complement, 
complement. 
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CAGCCCCCGTCTCGG 
CAGCCCCTGTCTCGG 


(SEQ 
(SEQ 


ID 
ID 


NO:54) 
NO: 55) 


and 
and 


its 
its 


complement, 
complement r 


5 


GGAGTTTGTACATGC 
GGAGTTTATACATGC 


(SEQ 
(SEQ 


ID 
ID 


NO: 5.6) 
NO: 5-7) 


and 
and 


its 
its 


complement, 
complement. 




CAGCTCCCCAGAGCA 
CAGCTCCTCAGAGCA 


(SEQ 
(SEQ 


ID 
ID 


NO: 58) 
NO: 59) 


and 
and 


its 
its 


complement, 
complement. 


10 


AGACAGGTCCTCGCC 
AGACAGGGCCTCGCC 


(SEQ 
(SEQ 


ID 
ID 


NO: 60) 
NO: 61) 


and 
and 


its 
its 


complement, 
complement , 




CTGCCCCTGGCAATG 
CTGCCCCCGGCAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 62) 
NO: 63) 


and 
and 


its 
its 


complement, 
complement, * 


AGGTGCATGTCCTCT 
AGGTGCACGTCCTCT 


(SEQ 
(SEQ 


ID. 
ID 


NO: 64) 
NO: 65) 


and 
and 


its 
its 


complement, 
complement. 


20 


GTGCATGTCCTCTTG 
GTGCATGCCCTCTTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 66) 
NO: 67) 


and 
and 


its 

its 


complement, 
complement , 




GGCTTATCCATGCCT 
GGCTTATTCATGCCT 


(SEQ 
(SEQ 


ID 
ID 


NO: 68) 
NO: 69) 


and 
and 


its 
its 


complement, 
complement , 


25 


AGCCAGGCTGGCAGA 
AGCCAGGGTGGCAGA 


(SEQ 
(SEQ 


ID 
ID 


NO: 70) 
NO: 71) 


and 
and 


its 
its 


complement, 
complement. 


30 


GGCCCACATGGAGGC 
GGCCCACGTGGAGGC 


(SEQ 
(SEQ 


ID 
ID 


NO:72) 
NO: 73) 


and 
and 


its 
its 


complement, 
complement. 


' TAACACAGCCATCAA 
TAACACAACCATCAA 


(SEQ 
(SEQ 


ID 
ID 


NO:74) 
NO: 75) 


and 
and 


its 
its 


complement, 
complement. 


35 


TAATGCTCGTCTGTG 
TAATGCTTGTCTGTG 


(SEQ 
(SEQ 


ID 
ID 


NO:76) 
NO:77) 


and 
and 


its 
its 


complement, 
complement. 




ACTTGCCGTCTGGGT 
ACTTGCCATCTGGGT 


(SEQ 
(SEQ 


ID 
ID 


NO:78) 
NO: 79) 


and 
and 


its 
its 


complement, and 
complement. 



40 . 

An allele-specific oli^micleotide primer of the invention has a 3 ' terminal nucleotide, or 
preferably a 3" penultimate nucleotide^ that is complementary to only one nucleotide of a particular SNP, 
thereby acting as a primer for polym^rase-mediated extension only if the allele containing that nucleotide 
is present. Allele-specifie oligonucleotide primers hybridizing to either the coding or noncoding strand 

45 are contemplated by the invention. A preferred ASO primer for detecting IL4Ra-gene polymoiphisms 

comprises a nucleotide sequence, listed 5' to 3% selected from the group consisting of: 

CTGAGATTGCACCAC (SEQ ID NO: 80); GGCTGGAGTGCAGTG (SEQ ID NO:81); 
CTGAGATTGCACCGC (SEQ ID NO: 82); GGCTGGAGTGCAGCG (SEQ ID NO: 83); 

50 CTGTGCTTTTGTGCT (SEQ ID NO: 84); ACCAAGGGGAATAGC (SEQ ID NO: 85); 
CTGTGCTTTTGTGTT (SEQ ID NO:86) ; ACCAAGGGGAATAAC (SEQ ID NO: 87); 
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GAGTTCCTGGGCCGC 
GAGTTCCTGGGCCTC 


(SEQ 
(SEQ 


ID 
ID 


NO:88) ; 
NO: 90) ; 




GGAGCAGCCTGAGCG 
GGAGCAGCCTGAGAG 


(SEQ 
(SEQ 


ID NO:89) ; 
ID NO: 91) ; 


5 


TCCGAGTAAGCCTGC 
TCCGAGTAAGCCTCC 


(SEQ 
(SEQ 


ID 
ID 


NO: 92) ; 
NO: 94) ; 




TCCAGCTCCAGCGCA 
TCCAGCTCCAGCGGA 


(SEQ 
(SEQ 


ID NO: 93) ; 
ID NO: 95) ; 




TCCCTGAGAACAACG 
TCCCTGAGAACAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 96) ; 
NO: 98) ; 




ACCCCGCGCCTCCGT 
ACCCCGCGCCTCCAT 


(SEQ ID NO: 97) ; 
(SEQ ID NO: 99) ; 


10 


GAACAACGGAGGCGC 
GAACAACGGAGGCAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 100) 
NO:102) 


r 
r 


CACACGCACCCCGCG 
CACACGCACCCCGTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 101) ; 
NO: 103) ; 


15 


TGGTCAGTGCGGATA 
TGGTCAGTGCGGACA 


(SEQ 
(SEQ 


ID 
ID 


N0:104) 
NO: 106) 


w 
r 


CCAGTGTATAGTTAT 
CCAGTGTATAGTTGT 


(SEQ 
(SEQ 


ID 
ID 


NO: 105) ; 
NO: 107) ; 


GCAGGGCGGAGTGCG 
GCAGGGCGGAGTGTG 


(SEQ 
(SEQ 


ID 
ID 


NO:108) 
NO:110) 


t 

7 


AGCCACCCCTGCCGC 
AGCCACCCCTGCCAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 109) ; 
NO: 111) ; 


20 


- ACAGCTGCCTGGGCT 
ACAGCTGCCTGGGTT 


(SEQ 
(SEQ 


ID 
ID 


N0:112) 
NO: 114) 


f 


CACCCCACCCTCAGC 
CACCCCACCCTCAAC 


(SEQ 
(SEQ 


ID 
ID 


NO:113) ; 
NO: 115) ; 




TGAGGGTGGGGTGGG 
TGAGGGTGGGGTGAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 116) ; 
NO: 118) ; 


CCTCCTCCCCrGCCC 
CCTCCTCCCCTGCTC 


(SEQ 
(SEQ 


ID 
ID 


NO: 117) ; 
NO: 119) ; 


25 


GGCCGCTTCTCCCGC 
GGCCGCTTCTCCCAC 


(SEQ 
(SEQ 


ID 
ID 


NO: 120) ; 
NO: 122) ; 


CTGGGTTTCACTGCG 
CTGGGtTTCACTGTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 121) ; 
NO: 123) 




TTTGGAGTGAAAACG 
TTTGGAGTGAAAATG 


(SEQ 
(SEQ 


ID 
ID 


NO: 124) 
NO:126) 


f 

r 


CATCTGCCGGGTCGT 
CATCTGCCGGGTCAT 


(SEQ 
(SEQ 


ID 
ID 


NO: 125) ; 
NO: 127) ; 


CTGGGAGGCAAGCCC 
CTGGGAGGCAAGCTC 


(SEQ 
(SEQ 


ID 
ID 


NO:128) 
NO:130) 


r 
f 


TATCCAGCCCCAGGG 
TATCCAGCCCCAGAG 


(SEQ 
(SEQ 


ID 
ID 


NO:129) ; 
NO: 131) ; 


35 


AGGCAAGCCCTGGGG 
AGGCAAGGCCTGGAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 132) 
NO:134) 


r 
f 


TTTGCTATCCAGCCC 
TTTGCTATCCAGCTC 


(SEQ 
(SEQ 


ID 
ID 


NO: 133) ; 
NO: 135).; 




GGCTGGATAGCAAAT 
GGCTGGATAGCAATT 


(SEQ 
(SEQ 


ID 
ID 


NO:.136) 
NO: 138) 


/ 
e 


CTAGCTCCTGGGATT 
CTAGCTCCTGGGAAT 


(SEQ 
(SEQ 


ID 
ID 


NO:137) ; 
NO: 139) ; 


40 


CACCTGGCTCTGCCC 
CACCTGGCTCTGCAC 


(SEQ 
(SEQ 


ID 
ID 


NO:140) 
NO:142) 


/ 

• 
/ 


GGGACTTGCCTAGGG 
GGGACTTGCCTAGTG 


(SEQ 
(SEQ 


ID 
ID 


•N0:141) ; 
NO: 143) ; 


45 


CCTGGCCCCCCACCC 
CCTGGCCCCCCACTC 


(SEQ 
(SEQ 


ID 
ID 


NO: 144) 
NO:146) 


/ 

/ 


CTCTGATGTGAGGGG 
CTCTGATGTGAGGAG 


(SEQ 
(SEQ 


ID 
ID 


NO: 145) ; 
NO: 147) ; 


GAACCCTCCCTCCGC 
GAACCCTCCCTCCAC 


(SEQ- 
(SEQ 


ID 
ID 


NO:148) 
NO:150) 


r 


GCTGGCTGCGATGCG 
GCTGGCTGCGATGTG 


(SEQ 
(SEQ 


ID 
ID 


NO: 149) ; 
NO: 151) ; 


50 


TAGATACACCTGCTG 
TAGATACACCTGCCG 


(SEQ 
(SEQ 


ID 
ID 


NO: 152) 
NO: 154) 


# 


GCAGATACACCACAG 
GCAGATACACCACGG 


(SEQ 
(SEQ 


ID 
ID 


NO: 153) ; 
NO: 155) ; 




GAAGGCATGTCTGAA 
GAAGGCATGTCTGCA 


(SEQ 
(SEQ 


ID 
ID 


NO:156) 
NO: 158) 


# 

/ 


ATGGCTGTCTACTTC 
ATGGCTGTCTACTGC 


(SEQ 
(SEQ 


ID 
ID 


NO: 157) ; 
NO: 159) ; 


55 


GAACCCTGACCAACC 
GAACCCTGACC7VATC 


(SEQ 
(SEQ 


ID 
ID 


NO: 160) 
NO: 162) 


r 
r 


TGCAAAAGCAAAGGA 
TGCAAAAGCAAAGAA 


(SEQ 
(SEQ 


ID 
ID 


NO: 161) ; 
NO: 163) ; 
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TCTTGCCCTGTTTTC 


(SEQ 


ID 


NO:1643 




TGTTGTGCTCCAGAA 


(SEQ 


ID 


NO: 165) ; 




TCTTGCCCTGTTTCC 


(SEQ 


ID 


NO:166) 




TGTTGTGCTCCAGGA 


(SEQ 


ID 


NO: 167) ; 




TGTTCCTGGACCTGC 


(SEQ 


ID 


NO:168) 


/ 


TCTCCTCTCCGAGCA 


(SEQ 


ID 


NO: 169) ; 


5 


TGTTCCTGGACCTTC 


(SEQ 


ID 


NO: 170; 




TCTCCTCTCCGAGAA 


(SEQ 


ID 


NO: 171) ; 




TGGGGGAGTCATGCC 


{SEQ 


ID 


NO: 172) 


w 


AAGGTGGAAGAAGGC 


(SEQ 


ID NO: 173); 




TGGGGGAGTCATGTC 


(SEQ 


ID 


NO: 174) 




AAGGTGGAAGAAGAC 


(SEQ 


ID NO: 175) ; 


10 


AGTCATGCCTTCTTC 


(SEQ 


ID 


NO:176) 


r 


TTCCCGAAGGTGGAA 


(SEQ 


ID NO: 177) ; 




AGTCATGCCTTCTCC 


(SEQ 


ID 


NO: 178] 


/ 


TTCCCGAAGGTGGGA 


(SEQ 


ID NO: 179); 




CAGCTGCAGCCCCCG 


(SEQ 


ID 


NO:1&0) 


r 


TGGGGGCCGAGACGG 


(SEQ 


ID 


NO: 181); 


15 


CAGCTGCAGCCCCTG 


(SEQ 


ID 


NO:182) 


r. 


TGGGGGCCGAGACAG 


(SEQ 


ID NO: 183); 




CTATCAGGAGTTTGT 


(SEQ 


ID 


NO:184) 


r 


TCCACCGCATGTACA 


(SEQ 


ID 


NO: 185) ; 




CTATCAGGAGTTTAT 


(SEQ 


ID 


NO: 186) 


r 


TCCACCGCATGTATA 


(SEQ 


ID 


NO: 187) ;■ . 




CCCAAGCAGCTCCCC 


(SEQ 


ID 


NO: 188) 


i 


CCCAGGTGCTCTGGG 


(SEQ 


ID. NO: 189); 


20 


CCCAAGCAGCTCCTC 


(SEQ 


ID 


NO:190) 


} 


CCCAGGTGCTCTGAG 


(SEQ 


Tn 


vrr\« 1 Ql \ ■ 
JMvy • J.^JL / r 




• CTGTGGAGACAGGTC 


(SEQ 


ID 


NO: 192) 




GTAGGGGGCGAGGAC 


(SEQ 


ID 


NO: 193) ; 




CTGTGGAGACAGGGC 


(SEQ 


ID 


NO: 194) 


t 


GTAGGGGGCGAGGCC 


(SEQ 


ID 


NO: 195) ; 


25 


TCCATCCTGCCCCTG 


(SEQ 


ID 


NO: 196) 


} 


TCTGAGCATTGCCAG 


(SEQ 


ID 


NO: 197) ; 




TCCATCCTGCCCCCG 


(SEQ 


ID 


NO: 198) 


r 


TCTGAGCATTGCCGG 


(SEQ 


ID 


NO: 199) ; 




TCTCTTAGGTGCATG 


(SEQ 


ID 


NO:200) 


} 


GCAACAAGAGGACAT 


(SEQ 


ID 


NO: 201) ; 


30 


TCTCTTAGGTGCACG 


(SEQ 


ID 


N0:2P2) 


/ 


GCAACAAGAGGACGT 


(SEQ 


ID 


NO: 203) ; 




TCTTAGGTGCATGTC 


(SEQ- 


ID 


NO:204) 


} 


CAGCAACAAGAGGAC 


(SEQ 


ID 


NO: 205) ; 




TCTTAGGTGCATGCC 


(SEQ 


ID 


NO:206] 


r 


CAGC/yVCAAGAGGGC 


(SEQ 


ID 


NO: 207); 




GACTAGGGCTTATCC 


(SEQ 


ID 


NO: 208) 


f 


TTTCCCAGGCATGGA 


(SEQ 


ID 


NO: 209) ; 


35 


GACTAGGGCTTATTC 


(SEQ 


ID 


NO:210) 


} 


TTTCCCAGGCATGAA 


(SEQ 


ID 


N0:211) ; 




GAAGGCAGCCAGGCT 


(SEQ 


ID 


NO:212) 


/ 


TGGAAATCTGCCAGC 


(SEQ 


ID 


NO:2ia) ; 




GAAGGCAGCCAGGGT 


(SEQ 


ID 


NO:214) 


/ 


TGGAAATCTGCCACC 


(SEQ 


ID 


NO: 215) ; 


40 


GATCATGGCCCACAT 


(SEQ 


ID 


NO:216} 


f 


AGGTGGGCCTCCATG 


(SEQ 


ID 


NO:217) ; 




GATCATGGCCCACGT 


(SEQ 


ID 


NO:218] 


/ 


AGGTGGGCCTCCAGG 


(SEQ 


ID 


NO:219) ; 




AGAAACTAACACAGC 


(SEQ 


ID 


NO:220) 


r 


ATTCCCTTGATGGCT 


(SEQ 


ID 


NO: 221) ; 


45 


AGAAACTAACACAAC 


(SEQ 


ID 


NO: 222) 


r 


ATTCCCTTGATGGTT 


(SEQ 


ID 


NO: 223) ; 




GTTGAGTAATGCTCG 


(SEQ 


IP 


NO:224) 


i 


AAAACACACAGACGA 


(SEQ 


ID 


NO: 225) ; 




GTTGAGTAATGCTTG 


(SEQ 


ID 


NO: 226) 


* 
t 


AAAACACACAGACAA 


(SEQ 


ID 


NO: 227) ; 




TAAGAAACTTGCCGT 


(SEQ 


ID 


NO:228) 


7 


ACCCAAACCCAGACG 


(SEQ 


ID 


NO:229) ; 


50 


TAAGAAACTTGCCAT 


(SEQ 


ID 


NO:230) 


7 


and ACCCAAACCCAGATG 


[SEQ ID NO: 231) 



Other genotyping oligonucleotides of the invention hybridize to a target region located one to 
several nucleotides downstream of one of the novel polymoxphic sites identified herein. Such 
oligonucleotides are useful in polymerase-mediated primer extension methods for detecting one of the 
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novel polymorphisms described herein and therefore such genotyping oligonucleotides are referred to 
herein as '*primer-exteasion oligonucleotides". In a preferred mbodiment» the 3 '-terminus of a primer- 
extension oligonucleotide is a deoxynucleotide complementary to the nucleotide located immediately 
adjacent to the polymoiphic site. A particularly preferred oligonucleotide primer for detecting I L4 Ra 
5 . gene polymorphisms by primer extension terminates in a nucleotide sequence, listed 5' to 3', selected 
from the group consisting of: 





AGATTGCACC 


(SEQ 


ID 


NO: 232) ; 


TGGAGTGCAG 


(SEQ 


ID NO:233) 




TGCTTTTGTG 


(SEQ 


ID 


NO:234) ; 


AAGGGGAATA 


(SEQ 


ID NO:235) 




TTCCTGGGCC 


(SEQ 


ID 


NO;236) ; 


GGAGCCTGAG 


(SEQ 


ID NO:237) 


10 


GAGTAAGCCT 


{SEQ 


ID 


NO:238) ; 


AGCTCCAGCG 


(SEQ 


ID NO:239) 




CTGAGAACAA 


(SEQ 


ID 


NO:240) ; 


CCGCGCCTCC 


(SEQ 


ID NO:241) 




CAACGGAGGC 


• (SEQ 


ID 


NO:242) ; 


ACGCACCCCG 


(SEQ 


ID NO:243) 




TCAGTGCGGA 


(SEQ 


ID 


NO:244) ; 


GTGTATAGTT 


(SEQ 


ID NO:245) 




GGGCGGAGTG 


(SEQ 


ID 


NO:246) ; 


CACCCCTGCC 


(SEQ 


ID NO:247) 


15 


GCTGCCTGGG 


(SEQ 


ID 


NO:248) ; 


CCCACCCTCA 


(SEQ 


I.D NO:249) 




GGGTGGGGTG 


(SEQ 


ID 


NO:250) ; 


CCTCCCCTGC 


(SEQ 


ID NO: 251) 




CGCTTCTCCC 


(SEQ 


ID 


NO: 252) ; 


GGTtTCACTG 


(SEQ 


ID NO:253) 




GGAGTGAAAA 


(SEQ 


ID 


NO: 254) ; 


CTGCCGGGTC 


(SEQ 


ID NO:255) 




GGAGGCAAGC 


(SEQ 


ID 


NO: 256) ; 


CCAGCCCCAG 


(SEQ 


ID NO:257) 


20 


, CAAGCCCTGG 


(SEQ 


ID 


NO: 258) ; 


GCTATCCAGC 


(SEQ 


ID NO:259) 




TGGATAGCAA 


(SEQ 


ID 


NO: 260) ; 


GCTCCTGGGA 


(SEQ 


ID NO:261) 




CTGGCTCTGC 


(SEQ 


ID 


NO:262).; 


ACTTGCCTAG 


<SEQ 


ID NO: 263) 




GGCCCCCPAC 


(SEQ 


ID 


NO: 264) ; 


TGATGTGAGG 


(SEQ 


ID NO:265) 




CCCTCCCTCC 


(SEQ 


ID 


NO: 266) ; 


GGCTGCGATG 


(SEQ 


ID NO:267) 


25 


ATACACCTGC 


(SEQ 


ID 


NO: 268) ; 


GATACACCAC 


(SEQ 


ID NO:269) 




GGCATGTCTG 


(SEQ 


ID 


NO: 270) ; 


GCTGTCTACT 


(SEQ 


ID NO:271) 




CCCTGACCAA 


(SEQ. 


ID 


NO: 272) ; 


AAAAGCAAAG 


(SEQ 


ID N0:273) 




TGCCCtGTTT 


(SEQ 


ID 


NO: 274) ; 


TGTGCTCCAG 


(SEQ 


ID NO:275) 




TCCTGGACCT 


(SEQ 


ID 


NO: 276) ; 


CCTCTCCGAG 


(SEQ 


ID NO: 277) 


30 


GGGAGTCATG 


(SEQ 


ID 


NO: 278) ; 


GTGGAAGAAG 


(SEQ 


ID NO: 279) 




CATGCCTTCT 


(SEQ 


ID 


NO:280) ; 


CCGAAGGTGG 


(SEQ 


ID NO:281) 




CTGCAGCCCC 


(SEQ 


ID 


NO: 282) ; 


GGGCCGAGAC 


(SEQ 


ID NO:283) 




TCAGGAGTTT 


(SEQ 


ID 


NO:284) ; 


ACCGCATGTA . 


(SEQ 


ID NO:285) 




AAGCAGCTCC 


(SEQ 


ID 


NO:286) ; . 


AGGTGCTCTG 


(SEQ 


ID NO:287) 


35 


TGGAGACAGG 


(SEQ 


ID 


NO:288) ; 


GGGGGCGAGG 


(SEQ 


ID NO:289) 




ATCCTGCCCC 


(SEQ 


ID 


NO: 290) ; 


GAGCATTGCC 


(SEQ 


ID NO:291) 




CTTAGGTGCA 


(SEQ 


ID 


NO: 292) ; 


ACAAGAGGAC 


(SEQ 


ID NO:293) 
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TAGGTGCATG 


(SEQ 


ID 


NO: 294) ; 


CAACAAGAGG 


(SEQ 


ID 


NO:295) ; 


TAGGGCTTAT 


(SEQ 


ID 


NO: 296) ; 


CCCAGGCATG 


(SEQ 


ID 


NO:297) ; 


GGCAGCCAGG 


(SEQ 


ID 


NO: 298) ; 


AAATCTGCCA 


(SEQ 


ID 


NO:299) ; 


CATGGCCCAC 


(SEQ 


ID 


NO: 300) ; 


TGGGCCTCCA 


(SEQ 


ID 


NO: 301) ; 


AACTAACACA 


(SEQ 


ID 


NO: 302) ; 


CCCTTGATGG 


(SEQ 


ID 


NO: 303) ; 


GAGTAATGCT 


(SEQ 


ID 


NO:304) ; 


ACACACAGAC 


(SEQ 


ID 


NO:305) ; 


GAAACTTGCC 


(SEQ 


ID 


NO: 306) ; 


and CAAACCCAGA 


(SEQ 


ID 


NO: 307) . 



In some embodiments, a conq)osition contains two or more differently labeled genotyping 
1 0 oligonucleotides for simultaneously probing the identity of nucleotides at two or more polymorphic sites. 
It is also contemplated that primer compositions may contain two or niore sets of allele-specific prim^ 
pairs to allow simultaneous targ^ing and amplification of two or more regions containing a polymorphic 
site. 

IL4Ra genotyping oligonucleotides of the invaition may also be immobilized on or synthesized 

15 on a solid surface such as a microchip, bead, or glass slide (see, e.g., .WO 98/20020 and WO 98/20019). 
Such immobilized genotyping oligonucleotides may be used in a variety of polymorphism detection 
assays, including but not limited to probe hybridization and polymerase extension assays. Immobilized 
IL4Ra genotyping oligonucleotides of the invention may comprise an ordered array of oligonucleotides 
designed to rapidly screen a DNA sample for polymorphisms in multiple genes at the same time. 

20 In another embodiment, the invention provides a kit comprising at least two genotyping 

oligonucleotides packaged in separate containers. The kit may also contain other components such as 
hybridization buffer (where the oligonucleotides are to be used as a probe) packaged ia a sqparate 
container: Alternatively, where the oligonucleotides are to be used to amplify a target region, the kit may 
contain, packaged in separate containers, a polymerase and a reaction buffer optimized for primer 

25 extension mediated by the polymerase, such as PGR. 

The above described oligonucleotide compositions and kits are useful in methods for genotyping 
and/or haplotyping the IL4Ra gene in an individual. As used herein, the terms "IL4Ra genotype" and 
"IL4Ra haplotype" mean the genotype or haplotype contains the nucleotide pair or nucleotide, 
respectively, that is present at one or more of the novel polymorphic sites described herein and may 

30 optionally also include the nucleotide pair or nucleotide present at one or more additional polymorphic 
sites in the IL4Ra gene. The additional polymorphic sites may be currently known polymorphic sites or 
sites that are subsequently discovered. 

One ^bodiment of the genotyping method involves isolating from the individual a nucleic acid 
mixture comprising the two copies of the IL4Ra gene, or a fragment th^^f, that are present in the 

35 individual, and determining the identity of the nucleotide pair at one or more of the polymorphic sites 
selected jftom PSl, PS2, PS3, PS4, PS6, PS7, PS8, PS9, PSIO, PSll, PS12, PS13, PS14, PS15, PS16, 
PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34. PS35. PS36. 
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PS37, PS38, PS39, PS40, PS41, PS44, and PS45 in th two copies to assign an IL4Ra g^otype to the 
individual. As will be readily understood by the skilled artisan, the two '^copies" of a ^ne in an 
individual may be the same allele or may be di£ferent alleles. In a preferred embodiment of the 
genotyping method, the identity of the nucleotide pair atone or more of the polymorphic sites selected 

• 5 from the group consisting ofPS5,PS24,PS26,PS29,PS31,PS42, and PS43 is also determined. In a 
particularly preferred embodiment, the genotyping method comprises determining the identity of the 
nucleotide pair at each of PS I -45. 

Typically, the nucleic acid mixture is isolated from a biological sample taken from the individual^ 
such as a blood sample or tissue sample. Suitable tissue samples include whole blood, semen saliva, 

1 0 tears, urine, fecal material, sweat, buccal, skin and hair. The nucleic acid mixture may be comprised of 
genomic DNA, mRNA, or cDNA and, in the latter two cases, the biological sample must be obtained 
. from an org^n in which the IL4Ra gene is expressed. Furth^more it wiU be understood by the skilled 
artisan that mRNA or cDNA preparations would not be used to detect polymorphisms located in introns 
or in 5' and 3' nontranscribed regions. If an IL4Rd gene fragment is isolated, it must contain the 

15 polymorphic-site(s) tobegenotyped. 

One embodiment of the haplotyping method comprises isolating from the individual a nucleic 
acid molecule containing only one of the two copies of the IL4Ra gene, or a fragment thereof, that is 
present in the individual and determining in that copy the identity of the nucleotide at one or more of the 
polymoiphic sites PSl, PS2, PS3, PS4, PS6, PS7, PS8, PS9, PSIO, PSl 1, PS12, PS13, PS14, PS15, PS16, 

20 PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35, PS36, 
PS37, PS38, PS39, PS40, PS41, PS44, and PS45 in that copy to assign an IL4Ra haplotype to the 
individual. The nucleic acid may be isolated using any method capable of separating the two copies of 
the IL4Ra gene or fragment such as one of the methods described above for preparing IL4Ra isogenes, 
with targeted in vivo cloning being the preferred approach. As will be readily appreciated by those skilled 

25 in the art, any individual clone will only provide haplotype information on one of the two IL4Ra gene 
copies present in an individual. If haplotype information is desired for the individual's other copy, 
additional IL4Ra clones will need to be examined. Typically, at least five clones should be examined to * 
have more than a 90% probability of haplotyping both copies of the IL4Ra gene in an individual. In 
some embodiments, the haplotyping method also comprises identifying the nucleotide atone or more of 

30 the polymorphic sites PS5, PS24, PS26, PS29, PS3 1 , PS42, and PS43. In a particularly preferred 
embodiment, the nucleotide at each of PS 1 -45 is identified. 

In a preferred embodiment, an IL4Ra haplotype pair is determined for an individual by 
identiiying the phased sequence of nucleotides at one or more of the polymorphic sites selected from PSl, 
PS2, PS3, PS4, PS6, PS7, PS8, PS9, PSIO, PSl 1, PS12, PS13, PS14, PS15, PS16, PS17. PS18, PS19, 

35 PS20, PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35, PS36, PS37, PS38, PS39, 
PS40, PS41, PS44, and PS45 in each copy of the IL4Ra gene that is present in the individual. In a 
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particularly preferred embodiment, the haplotyping method comprises identifying the phased sequenc of 
nucleotides at each of PSl-45 in each copy of the IL4Ra gene. When haplotyping both copies of the 
gene, the identifying step is preferably performed with each copy of the gene being placed in separate 
containers. However, it is also envisioned that if the two copies are labeled with different tags, or are 
S otherwise separately distinguishable or identifiable, it could be possible in some cases to perform ft^ 
method in the same container. For example, if furst and second copies of the gene are labeled with 
different first and second fluorescent dyes, respectively, and an allele-specific oligonucleotide labeled 
with yet a third different fiuorescent dye is used to assay the polymorphic site(s), then detecting a 
combination of the first and third dyes would identify the polymorphism in the first gene copy while 
10 detecting a combination of the second and third dyes would identify the polymorphism in the second gene 
copy, 

Li both the g^otyping and haplotyping methods, the identity of a nucleotide (or nucleotide pair) 
at a polymorphic 5ite(s) may be determined by amplifying a target region(s) containing the polymorphic 
site($) directly firom one or both copies of the IL4Ra gene, or fiagment thereof, and the sequence of the 

15 amplified region(s) determined by conventional methods. It will be readily appreciated by the skilled* 

artisan that only one nucleotide will be detected at a polymorphic site in individuals who are homozygous 
at that site, while two different nucleotides will be detected if the individual is heterozygous for that site. 
The polymorphism may be identified directly, known as positive-type identification, or by inference, 
referred to as negative-type identification. For example, where a SNP is known to be guanine and . 

20 cytosine in a reference population, a site may be positively detennined to be either guanine or cytosine for 
an individual homozygous at that site, or both guanine and cytosine, if the individual is heterozygous at 
that site. Alternatively, the site may be negativ^y determined to be not guanine (and thus 
cytosine/cytosine) or not cytosine (and thus guanine/guanine). 

In addition, the identity of the allele(s) present at any of the novel polymoxphic sites described 

25 ' herein may be indirectly determined by genotyping a polymorphic site not disclosed herein that is in 
linkage disequilibrium with the polymoiphic site that is of interest. Two sites are said to be in linkage 
disequilibrium if the presence of a particular variant at one site enhances the predictability of another 
variant at the second site (Stevens, JC 1999, MoL Diag. 4: 309-17). Polymorphic sites in linkage 
disequilibrium, with the presently disclosed polymorphic sites may be located in regions of the gene or in 

30 other genomic regions not examined herein. Genotyping of a polymorphic site in linkage disequilibrium 
with the novel polymorphic sites described herein may be perfonned by, but is not limited to, any of the 
above-mentioned methods for detecting the identity of the allele at a polymorphic site. 

The target region(s) may be amplified using any oligonucleotide-directed amplification method, 
including but not limited to polymerase chain reaction (PGR) (U.S. Patent No. 4,965, 1 88), ligase chain 

35 reaction (LCR) (Barany et al., Proc. Natf. Acad. Sci. USA 88:189-193,^ 1991; WO90/01069), and 

oligonucleotide ligation assay (OLA) (Land grenetal., Science 241:1077-1080, 1988). Oligonucleotides 
usefiil as prim rs or probes in such methods should specifically hybridize to a region of the nucleic acid 
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tliat contains or is adjacent to the polymorphic site. Typically, the oligonucleotides are between 10 and 
35 nucleotides in length and preferably, between 15 and 30 nucleotides in length. Most preferably, the 
oligonucleotides are 20 to 25 nucleotides long. The exact length of the oligonucleotide will depend on 
many factors that are routinely considered and practiced by the skilled artisan. 
5 Other known nucleic acid amplification procedures may be used to amplify the target region 

including transcription-based amplification systems (U.S. Patent No, 5,130,238; EP 329,822; U.S. Patent 
No. 5,169,766, WO89/06700) and isothermal methods (Walker et al., Proc. NatL Acad ScL USA 89:392- 
396,1992). 

A polymorphism in the target region may also be assayed before or after amplification using one 
10 of several hybridization-based methods known in the art. Typically, allele-specific oligonucleotides are 
utilized in performing such methods. The allele-specific oligonucleotides may be used as differ^tly 
labeled probe pairs, with one member of the pair showing a perfect match to one variant of a target 
sequence and the other member showing a p^ect match to a difiTerent variant. In some embodiments, 
more than one polymorphic site may be detected at once usiiig a set of allele-specific oligonucleotides or 
15 oligonucleotide pairs. Preferably, the members of the set have melting temperatures within 5**C, and 
more preferably within 2**C, of each other when hybridizing to each of the polymorphic sites being 
detected. 

Hybridization of an allele-specific oligonucleotide to a target polynucleotide may be performed 
with both entities in solution, or such hybridization may be performed when either the oligonucleotide or 

20 the target polynucleotideis covalently or noncovalently affixed to a solid support* Attachment may be 
mediated, for example, by antibody-antigen interactions, poly-Lr-Lys, streptavidin or avidin-biotin, salt 
bridges, hydrophobic interactions, chemical linkages, UV onoss-linking baking, etc. Allele-specific 
oligoiiucleotides may be synthesized directiy on the solid support or attached to the solid support 
subsequent to synthesis. Solid-supports suitable for use in detection methods of the invention include 

25 substrates made of silicon, glass, plastic, paper and the like, which may be formed, for example, into 

wells (as in 96-well plates), slides, sheets, membranes, fibers, chips, dishes, and beads. The solid support 
may be treated, coated or derivatized to facilitate the immobilization of the allele-specific oligonucleotide 
or target nucleic acid. 

The genotype or haplotype for the IL4Ra gene of an individual may also be determined by 
30 hybridization of a nucleic sample containing one or both copies of the gene to nucleic acid arrays and 
subarrays such as described in WO 95/1 1995. The arrays woiild contain a battery of allele-specific 
oligonucleotides representing each of the polymorphic sites to be included in the genotype or haplotype. 

The idmtity of polymorphims may also be detennined using a mismatch detection technique, 
including but not limited to the RNase protection method using riboprobes (Winter et al., Proc. Natl. 
35 Acad. Sci. USA 82:7575, 1985; Meyers et al.. Science 230:1242, 1985) and proteins which recognize 
nucleotide mismatches, such as the E. coli mutS protein (Modrich, P. Ann. Rev. Genet. 25:229-253, 
1991). Alternatively, variant alleles can be identified by single strand conformation polymorphism 
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(SSCP) analysis (Oiita et al.. Genomics 5:874-879, 1989; Humphries et al., in Molecular Diagnosis of 
Genetic Diseases, R. Elles, ed., pp, 321-340, 1996) or denaturing gradient gel electrophoresis (DGGE) 
(Wartell et al., NucL Acids Res. 18:2699-2706, 1990; Sheffield et al., Pioc, Natl. Acad. Sci. USA 86:232- 
236, 1989). 

5 A polymerase-mediated primer extension method may also be used to identify the 

polymorphism(s). Several such methods have been described in the patent and scientific literature and 
include the "Genetic Bit Analysis" method (W092/15712) and the ligase/polymerase mediated genetic bit 
analysis (U.S. Patent 5,679,524. Related methods are disclosed in WO91/02087, WO90/09455, 
W095/17676, U.S. Patent Nos. 5,302,509, and 5,945,283. Extended primers containing a polymorphism 

10 may be detected by mass spectrometry as described in U.S. Patent No. 5,605,798. Anoth^ primer 
extension method is allele-specific PGR (Ruano et al., Nucl. Acids Res.- 17:8392, 1989; Ruano et al., 
Nucl. Adds Res. 19, 6877-6882, 1991; WO 93/22456; Turid et al., J. Clin. Invest. 95:1635-1641, 1995). 
In addition, multiple polymorphic sites may be investigated by simultaneously amplifying multiple 
regions of the nucleic acid using sets of allele-specific primers as described in Wallace et al. 

15 (WO89/10414). 

.In another aspect of the invention, an individual's IL4Ra haplotype pair is predicted fiom its 
IL4Ra genotype using information on haplotype pairs known to exist in a reference population. In its 
broadest embodiment, the haplotyping prediction method comprises identifying an IL4Ra genotype for 
the individual at two or more polymorphic sites selected fix)m PSl , PS2, PS3, PS4, PS6, PS7, PS8, PS9, 

20 PSIO, PSn, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS25, PS27, 
PS28, PS30, PS32, PS33, PS34, PS35, PS36, PS37, PS38, PS39, PS40, PS41 , PS44, and PS45, 
enumemting all possible haplotype pairs which are consistent with the genotype, accessing data 
containing IL4Ra haplotype pairs identified in a reference population, and assigning a Haplotype pair to 
the individual that is consistent with the data. In one embodiment, the reference haplotype pairs include 

25 the IL4Ra haplotype pairs shown in Table 4. 

Generally, the reference population should be composed of randomly-selected individuals 
representing the inajor ethnogeographic groups of the world. A preferred reference population for use in 
the methods of the present invention comprises an approximately equal number of individuals fiom 
Caucasian, African American, Asian and Hispanic-Latino population groups with the minimum numb^ 

30 of each group being chosen based on how rare a haplotype one wants to be guaranteed to see. For 

example, if one wants to have a q% chance of not missing a haplotype that exists in the population at a 
p% frequency of occurring in the reference population, the number of individuals (n) who must be 
sampled is given by 2n=log(l-q)/log(l-p) where p and q are expressed as fractions. A preferred reference 
population allows the detection of any haplotype whose frequency is at least 10% with about 99% 

35 certainty and comprises about 20 unrelated individuals fix»m each of the four population groups named 

above. A particularly preferred refer^ce population includes a 3-generation &niily representing one or 

more of the four population groups to serve as controls for checking quality of haplotypine orocedures. 
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la a preteired embodiment, the iiapiotype ixequency data Cor each ethnogeographic group is 
examined to determine whether it is consistent with Hardy-Weinberg equilibrium. Hardy-Wemberg 
equilibrium (D.L. Hartl et aL, Principles of Population Genomics, Sinauer Associates (Sunderland, MA), 
3"* Ed., 1997) postulates that the fiequency of finding the haplotype pair /T, / H2 is equal to 

Statistically significant difference-between the observed and expected haplotype fi^equencies could be due 
to one or more fectors including significant inbreeding in the population group, strong selective pressure 
on the gene, sampling bias, and/or errors in the genotyping process. If laige deviations fit>m Hardy- 
Weinberg equilibrium are observed in an ethnogeogmphic group, the number of individuals in that group 

10 can be increased to see if the deviation is due to a sampling bias. If a larger sample size does not reduce 
the difference between observed and expected haplotype pair firequencies, then one may wish to consider 
haplotyping the individual using a direct haplotyping method such as, for example, CLASPER System'** 
technology (U.S. Patent No. 5,866,404), SMD, or allele-specific long-range PGR (Michalotos-Beloin et 
aL, Nucleic Acids Res. 24:4841-4843, 1996). 

15 In one embodiment of this method for predicting an IL4Ra haplotype pair, the assigning step 

involves performing the following analysis. First, each of the possible haplotype pairs is compared to the 
haplotype pairs in the reference population. Generally, only one of the haplotype pairs in the reference 
population matches a possible haplotype pair and that pair is assigned to the individual. Occasionally, 
only one haplotype represented in the reference haplotype pairs is consist^t with a possible haplotype 

20 pair for an individual, and in such cases the individual is assigned a haplotype pair containing this known 
haplotype and a new haplotype derived by subtracting the known haplotype from the possible haplotype 
pair. In rare cases, either no haplotypes in the reference population are consistent with the possible 
haplotype pairs, or alternatively, multiple reference haplotype pairs are consistent with the possible 
haplotype pairs. In such cases, the individual is preferably haplotyped using a direct molecular 

25 haplotyping method such as, for example, CLASPER System^ technology (U.S. Patent No. 5,866,404), 
SMD, or allele-s^)ecific long-range PGR (NCichalotos-Beloin et al.. Nucleic Acids Res. 24:4841-4843, 
1996). 

The invention also provides a method for determining the fiiequency of an IL4Ra genotype or 
IL4Ra h^lotype in a population. The method comprises determining the genotype or the haplotype pair 

30 for the IL4Ra gene that is present in each member of the population, wherein the genotype or haplotype 
comprises the nucleotide pair or nucleotide detected at one or more of the polymorphic sites PS 1 , PS2, 
PS3, PS4, PS6, PS7, PS8, PS9, PSIO, PSl 1, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, 
PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35, PS36, PS37, PS38, PS39, PS40, . 
PS41 , PS44, and PS45 in the IL4Ra gene; and calculating the fi-equency any particular genotype or 

35 haplotype is foimd in the population. The population may be a reference population, a family population, 
a same sex population, a population group, a trait population (e.g., a group of individuals exhibiting a trait 
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' of interest such as a medical condition or response to a therapeutic treatment). 

In another aspect of the invention, frequency data for IL4Ra genotypes and/or haplotypes found 
in a reference population are used in a method for identifying an association between a trait and an IL4Ra 
genotype or an IL4Ra haplotype. The trait may be any detectable phenotype, including but not limited to 
5 susceptibility to a disease or response to a treatment. The method involves obtaining data on the 
frequency of the genotype(s) or haplotype(s) of interest in a reference population as well as in a 
population exhibiting the trait. Frequency data for one or both of the reference and trait populations may 
be obtained by genotyping or haplotyping each individual in the populations using one of the methods 
described above. The haplotypes for the trait population may be determiued directly or, alternatively, by 

10 the predictive genotype to haplotype approach described above. In another embodiment, the frequency 
data for the reference aiid/or trait populations is obtained by accessing previously determined frequency 
data, which may be in written or electronic form. For example, the frequency data may be present in a 
database that is accessible by a computer. Once the frequency data is obtained, the frequencies of the 
genotype(s) or haplotype(s) of interest in the reference and trait populations are compared. In a preferred 
* 15 embodiment, the fi^iiCTcies of all genotypes and/or haplotypes observed in. the populations are 
compared. If a particular genotype or h^lotype for the IL4Ra gene is more frequent in the trait 
population than in the retrace population at a statistically significant amount, then the trait is predicted 
to be associated with that IL4Ra genotype or haplotype. Preferably, the llARa genotype or hi^lotype 
being compared in the trait and reference populations is selected from the fiill-genotypes and fiiU- 

20 haplotypes shown in Tables 4 and 5, respectively, or from sub-genotypes and sub-haplotypes derived 
from these genotypes and haplotypes. 

In a preferred embodiment of the method, the trait of interest is a clinical response exhibited by a 
patient to some therapeutic treatment, for example, response to a drug targeting IL4Ra or response to a 
therapeutic treatment for a medical condition. As used herein, *'medical condition" includes but is not 

25 limited tq any condition or disease manifested as one or more phjrsical and/or psychological symptoms 
for which treatmrat is desirable, and includes previously and newly identified diseases and other 
disorders. As used herein the term "clinical response" means any or all of the following: a quantitative 
measure of the response, no response, and adverse re^xmse (i.e., side effects). 

In order to deduce a correlation between clinical response to a treatment and an IL4Ra genotype 

• 30 or haplotype, it is necessary to obtain data on the clinical responses exhibited by a population of 

individuals who received the treatment, hereinafter the "clinical population". This clinical data may be 
obtained by analyzing the resuUs of a clinical trial that has aheady been run and/or the clinical data may 
be obtained by designing and carrying out one or more new clinical trials. As used herein, the term 
"clinical trial" means any research study designed to collect clinical data on responses to a particular 

35 treatment, and includes but is not limited to phase I, phase II and phase UI clinical trials. Standard 
methods are used to define the patient population and to enroll subjects. 
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It is preferred that the individuals included in the clinical population have been graded for the 
existence of the medical condition of interest. This is important in cases where the symptom(s) .being 
presented by the patients can be caused by more than one underlying condition, and where treatment of 
the underlying conditions are not the same. An example of this would be where patients exp^ence 
5 breathing difficulties that are due to either asthma or respiratory infections. If both sets were treated with 
an asthma medication, there would be a spurious group of apparent non-responders that did not actually 
have asthma. These peojple would affect the ability to detect any correlation between haplotype and 
treatment outcome. This grading of potential patients could employ a standard physical exam or one or 
more lab tests. Alternatively, grading of patients could use haplotyping for situations where there is a 
10 strong correlation between haplotype pair and disease susceptibility or severity. 

The therapeutic treatment of interest is administered to each individual in the tried population and 
' each individual's lespoiise to the treatment is measured using one or more predetermined criteri It is 
contemplated that in many cases, the trial population wOl exhibit a range of responses and that the 
investigator will choose the number of responder groups (e.g., low, medium, hig)i).made up by the 
15 various responses. In addition, the IL4Ra gene for each individual in the trial population is genotyped 
and/or haplotyped, which may be done before or after administering the treatment. 

After both the clinical and polymorphism data have been obtained, correlations between 
individual response and IL4Ra genotype or haplotype content are created. Correlations may be produced 
in several ways. In one method, individuals are grouped by their IL4Ra genotype or haplotype (or 
20 haplotype pair) (also referred to as a polymorphism group), and then the averages and standard deviations 
of clinical responses exhibited by the members of each polymorphism group are calculated. 

These results are then analyzed to determine if any observed variation in clinical response 
between polymorphism groups is statistically significant Statistical analysis methods which may be used 
are described in L.D.. Fisher and G. vanBelle, ''Biostatistics: A Methodology for the Health Sciences", 
25 Wil^-Interscience (New York) 1993. This analysis may also include a regression calculation of which 
polymorphic sites in the IL4Ra gene give the most significant contribution to the differences in 
phenotype. One regression model useful in the invention starts with a model of the form 

where r is the response, is a constant called fhc ^intercept", S is the slope and d is the dose. To 

30 determine the dose, the most-common and least conunon nucleotides at the polymorphic site are first 

defined. Then, for each individual in the trial population, one calculates a "dose" as the number of least- 
common nucleotides the individual has at the polymorphic site of interest. This value can be 0 
(homozygous for the least-common nucleotide), 1 (heterozygous), or 2 (homozygous for the most 
common nucleotide). An individual's '"response" is the value of the clinical measiuement. Standard 

35 linear regression methods are then used to fit all the individuals' doses and responses to a stagle model 
(see e.g., L.D. Fisher and G. vanBelle, supm, Ch 9). The outputs of the regression calculation are the 
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intercept » the slope S, and the variance (which measures how well the data fits this simple linear 

model). The Students t-test value and the level of significance can then be calculated for each of the 
polymorphic sites. 

A second method for finding correlations between IL4Ra haplotype content and clinical 
responses uses predictive models based on emor-minimizing optimization algorithms. One of many 
possible optimization algorithms is a genetic algorithm (R, Judson, '^Genetic Algorithms and Their Uses 
in Chemistry*' in Reviews in Computational Chemistry, Vol. 10, pp. 1-73, K. B. Lipkowitz and D. B. 
Boyd, eds, (VCH Publishers, New York, 1997). Simulated annealing (Press et al., '*Numerical Recipes in 
C: The Art of Scientific Computing", Cambridge University Press (Cambridge) 1992, Ch. 10), neural 
networks (E. Rich and Knight, "Artificial Intelligence", 2™^ Edition (McGraw-Hill, New York, 1991, 
Ch. 18), standard gradient descent methods (Press et al., supra Ch. 10), or other global or local 
optimization approaches (see discussion in Judson, supra) could also be used. As an example, a genetic 
algoritimi approach is described herein. This method searches for optimai parameters or weights in linear 
or noii-lineair models connecting IL4Ra'hdplotype loci and clinical outcome. One model is of the form 

where C is the measured plinical outcome, i gqes over all polymoiphic sites, a over all candidate genes, 
Cq, Wg^ and w^^ are variable weight values, i?,^ is equal to 1 if site i in gene a in the first haplotype 

takes on the most common nucleotide and —1 if it takes on the less common nucleotide. L^^ is the same 

as Rf^ except for the second haplotype. The constant term Cq and the weights w^^ and iv!^ are varied 

by the genetic algorithm during a search process that minimizes the error between the measured value of 
C and the value calculated firom Equation 1 . Models otha: than the one given in Equation 1 can be readily 
incorporated by those skilled in the art for analyzing the clinical and polymorphism data. The genetic 
algorithm is especially suited for searching not only over the space of weights in a particular model but 
also over the space of possible models (Judson, supra). . 

Correlationis may also be analyzed using analysis of variation (ANOVA) techniques to determine 
how much of the variation in the clinical data is explained by different subsets of the polymorphic sites in' 
the IL4Ra gene. ANOVA is used to test hypotheses about whether a respot^e variable is caused by or 
correlated with one or more traits or variables that can be measured (Fisher and vanBelle, supra^ Ch. 10). 
These traits or variables are called the independent variables. To carry out ANOVA, the independent 
variable(s) are measured and individuals are placed into groups based on their values for these variables. 
In this case, the independent variable(5) refers to the combination of polymorphisms present at a subset of 
the poljrmorphic sites, and thus, each group contains those individuals with a given genotype or haplotype 
pair. The variation in response within the groups and also the variation between groups is then measured. 
If the within-group response variation is large (people in a group have a wide range of responses) and the 
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response variation between groups is small (the average responses tor all groups are about the same) then 
it can be concluded that the independent variables used for the grouping are not causing or correlated with 
the response variable. For instance, if people are grouped by month of birth (which should have nothing 
to do with their response to a drug) the ANOVA calculation should show a low level of significance. 
5 However, if the response variation is larger between groups than within groups, the F-ratio (='*between 
groups" divided by **witbin groups'*) is greater than one. Large values of the F-ratio indicate that the 
independent variable is causing or correlated with the response. The calculated F-ratio is preferably 
compared with the critical F-distribution value at whatever level of significance is of interest. If the F- 
ratio is greater than the Critical F-distribution value, then one may be confident that the individual's 

10 genotype or haplotype pair for this particular subset of polymorphic sites in the IL4Ra gene is at least 
partially responsible for, or is at least strongly correlated with the clinical response. 

From the ainalyses described above, a mathematical model may be readily constructed by the 
skilled artisan that predicts clinical response as a function of IL4Ra genotype or haplotype content. 
Preferably, the model.is validated in one or more follow-up clinical trials designed to test the model. 

IS The identification of an association between a clinical response and a genotype or h^lotype (or 

haplotype pair) for the IL4Ra g^e may be the basis for designing a diagnostic method to determine those 
individuals who will or wUl not respond to the treatrnent, or alternatively, will respond at a lower level 
and thus may require more treatment, i.e., a greater dose of a drug- The diagnostic method may take one 
of several forms: for example, a direct DNA test (i.e., genotyping or haplotyping one or more of the 

20 polymorphic sites in the IL4Ra gene), a serological test, or a physical exam measm^ment. The only 
requirement is that there be a good correlation between the diagnostic test results and the imderlying 
IL4Ra genotype or haplotype that is in jtum correlated with the clinical response. In a preferred 
embodiment, this diagnostic method uses the predictive hs^lotyping method described above. 

Any or all analytical and matheiiiatical operations involved in practicing the methods of the 

25 present invmtion may be implemented by a computer. In addition, the computer may execute a program 
that generates views (or screens) displayed on a display device and with which the user can interact to 
view and analyze large amounts of information relating to the IL4Ra gene and its. genopuc variation, 
including chromosome location, gene structure, and gene family, gene expression data, polymorphism 
data, genetic sequence data, and clinical data population data (e.g., data on ethnogeographic origin, 

30 clinical responses, genot5^es, and haplotypes for one or more populations). The IL4Ra polymorphism 
data described herein may be stored as part of a relational database (e.g., an instance of an Oracle . 
database or a set of ASCII flat files). These pol^miorphism data may be stored on the computer*s hard 
drive or may, for example, be stored on a CD ROM or on one or more other storage devices accessible by 
the computer. For example, the data may be stored on one or more databases in comniunication with the 

35 computer via a network. 

Preferred embodiments of the invention are described in the following examples. Other 
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embodiments wittain the scope ot me ciamis tieran will be apparent to one skilled in the art from - 
consideration of the specification or practice of the invention as disclosed herein. It is intended that the 
specification, together with the-examples> be considered exemplary only» with tiie scope and spirit of the 
invention being indicated by the claims which foUow the example. 

5 

EXAMPLES 

The Examples herein are meant to exemplify the various aspects of canying out the invention and -* 
are not intended to limit the scope of the invention in any way. The Examples do not include detailed 
- descriptions for conventional methods employed, such as in the performance of genomic DNA isolation, 
10 PGR and sequencing procedures. Such methods are weU-known to those skilled in the art and are 

described in numerous publications, for example, Sambrook, Fritsch, and M aniatis, "Molecular Cloning: 
A Laboratory Manual", 2"** Edition, Cold Spring Harbor Laboratory Press, USA, (1989). 



15 - Example 1 A 

This example illustrates examination of various regions of the IL4Ra gene for polymorphic sites 
using DNA from Index Repository lA. 



Amplification of Target Regions 

20 The following target regions of the IL4Ra gene were amplified using the PCR primer pairs listed below, 

with the sequ^ces presented in the 5 ' to 3 ' direction and nucleotide positions shown for each region 

corresponding to the indicated GenBank Accession No. 

Accession Number AC004525 
Fragment 1 
25 Forward Primer 

32801-32822 CCACAGTCATCCCGACACTAGC (SEQ ID NO:308) 
Reverse Primer 

Complement of 33355-33334 TATTCCAGCCGTATCCATGTGC (SEQ ID NO:309) 
PCR product 555nt . . 

30 

Fragment 2 
Forward Primer 

35515-35536 CCTTGGTGCATGTGGTAAGAGG (SEQIDNO:310) 
Reverse Primer • 

35 Complement of 36068-36046 TTTCAAAGGTGGGAGGACTGAGG (SEQ ID NO:31 1) 
PCR product 554 nt 

Fragment 3 
Forward Primer 

40 37031-37050 GCAGTGAGCTGGGATTGTGC (SEQ ID NO:312) 
Reverse Primer 

Complement of 37701-37679 AACTCCCCTTCTCTGATGTGAGG (SEQ ID NO:313) 
PCR product 671 nt 
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Fragment 4 
Forward Primer 

43240-43262 TCACAGTTACAGAGGTGGCAAGC (SEQ ID NO:3 14) 
Reverse Primer 

5 Complement pf 43727-43706 CTGCCTACCTGGCAGATACACC (SEQ ID NO:315) 
PCR product 488 nt 

Fragments 
Forward Primer 

10 49553-49574. AGCTGTCACTCCACCTCCTTGG (SEQ IDNO:316) 
Reverse Primer 

Complement of 50036-50013 AAAGCCTCTGGTCTGCTAATGACC (SEQ ID NO:317) 
PCR product 484 nt 

15 Fragment 6 

Forward Primer 

51394-51415 GGGAGGAGATTCAGAGCACTCC (SEQ ID NO:318) 
Reverse Primer 

Complementof 51847-51826 CAGTCCACGTTTCCAGAACACC (SEQIDNO:319) 
20 PCR product 454 nt 

Fragment 7 
Forward Primer 

52806-52826 GGCTTGGGATAATGGTGTTGC (SEQ ID NO:320) 
25 Reverse Primer 

Complement of 53529-53507 TACTTCCCGAAGGTGGAAGAAGG (SEQ ID NO:321) 
PCRproduct 724nt 

Fragment 8 
30 Forward Primer 

53242-53265 CAGTGGAGATCAGCAAGACAGTCC (SEQ ID NO:322) 
Reverse Primer 

Complement of 53807-53786 GGGCATCTCGGGTTCTACTTCC (SEQ ID NO:323) 
PCRproduct 566 nt r 

35 . 

Fragment 9 
Forward Prim^ 

53522-53544 GGGAAGTACGAGTGCTCACATGC (SEQ ID NO:324) 
Reverse Primer 

40 Complement of 541 10-54Q88 CrTATACCCCTClTCCCCACTGC (SEQ ID NO:325) 
PCRproduct 589 nt 

Fragment 10 • 
Forward Primer 

45 53821-53841 TCTCTGAGCCAACCACTGTGC (SEQ ID NO:326) 
Reverse Primer 

Complement of 54359-54337 GGCTGAGTAGACAATGCCACTGC (SEQ ID NO:327) 
PCRproduct 539 nt 

50 Fragment 1 1 

Forward Primer 

54055-54076 CTGTGTCCCCAGAGAAATGTGG (SEQ ID NO:328) 
Reverse Primer 

Complement of 54717-54695 GACTCAGCAACAAGAGGACATGC (SEQ ID NO:329) 
55 PCRproduct 663 nt 
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Fragment 12 
Forward Primer 

54342-54365 GjGCATTGTCTACTCAGGCCTTACC (SEQ ID NO:330) 
Reverse Primer 

5 Complement of 54986-54967 ACAAGTCGAGGTGCCCAAGG (SEQ ID NO:331) 
PGR product 645 nt , 

Fragment 13 
Forward Primer 

1 0 54669-54693 CCCACATACATGAGGGTCTCTTAGG (SEQ ID NO:332) 
Reverse Primer 

Complement of 55270-55250 ATTCTGCCTCCAGCATCAACC (SEQ ID NO:333) 
FCR product 602 nt 

Fragment 14 
Forward Primer 

55235-55258 AACAGAGCTTCCTTAGGTTGATGC (SEQ ID NO:334) 
Reverse Primer 

Complement of 55847-55825 CCTCAGTTCCCCACTACCTTAGC (SEQ ID NO:335) 
PGR product 613 nt 

These primer pairs were used in PGR reactions containing genomic DNA isolated from 

immortalized cell lines for each member of Index Repository lA. The PGR reactions w^re carried out 

under the following conditions: 

25 Reaction volimie 

10 X Advantage 2 Polymerase reaction buffer (Clontech) 
100 ng of human genomic DNA 
lOmMdNTP 

Advantage 2 Polymerase ^izyme mix (Clontech) 
30 Forward Primer (10 jiM) 
Reverse Primer (10 |iM) 
Water 

. Amplification profile: 
35 94"C--2min. 1 cycle 

94**C-30sec. -| 
70*'C-45sec. L 10 cycles 

72**C - 1 min. | 



94''C - 30 sec. :) 
64**C-.45sec. L 35 cycles 

72**C-lmin. J 

45 Sequencing of PGR Products 

The PGR products were purified by Solid Phase Rev^ible Immobilization using the protocol 
developed by the Whitehead Genome Center, A detailed protocol can be found at 
http://www.genome.wi.mit.edu/sequencing/protocols/pure/SPRI_pcr.html. 

Briefly, five \il of carboxyl coated magnetic beads (10 mg/ml) and 60 \d of HYB BUFFER (2.5M 
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NaCl/20% PEG 8000) were add^d to each PCK reaction mixture (20 jxl). I'be reaction mixture was 
mixed well and incubated at room temperature (RT) for 10 min. The micrptitre plate was placed on a 
magnet for 2 min and the beads washed twice with 150 |uQ of 70% EtOH. The beads were air dried for 2 
tnin and the DNA was eluted in 25 ^ of distilled water and incubated at RT for S min. The beads were 
5 magnetically separated and the supernatant removed for testing and sequencing. 

The purified PGR products were sequenced in both directions using the primer sets described 
previously or those listed, in the 5 ' to 3 ' direction, below. 

Accession Number: AC004525 
Fragment 1 
10 Forward Primer 

32865-32882 GCGCTGGCCCTCAACTTT (SEQ ID NO:336) 
Reverse Primer 

Complement of 33283-33264 GTCCCTGGAGATGGGACCTC (SEQ ID NO:337) 

15 Fragment 2 

Forward Primer 

35598-35617 GCCCCCAGATCTGTCCTCAC (SEQ ID NO:338) - 
Reverse Primer 

Complement of 36013-35994 GGAAAATACAGGCGGCTTGC (SEQ ID NO:339) 

20 

Fragment 3 
Forward Primer 

37182-37203 GGCTCTGAATCTGTGTGGTGCT (SEQ ID NO:340) 
Reverse Primer 

25 Complement of 37639-37620 AGCCAGGTGAGAAGCCAGGT (SEQ ID NO:341) 

Fragment 4 
Forward Primer 

43266-43285 GGCCTGAACAGGACGAACAA (SEQ ID NO;342) 
30 Reverse Primer 

Complement of 43687-43668 GGCAGGATTGCCATTAG AGG (SEQ ID NO:343) 

Fragment 5 
Forward Primer 

35 49639-49660 TGAGTCAGTGGTTTGACCTCCA (SEQ ID NO:344) 
Reverse Primer 

Complement of 49999-49980 GCCTCTGTCTCCCCTGCAAC (SEQ ID NO:345) . 

Fragment 6 
40 Forward Primer 

51423-51442 CCACTTTTGCCATCGACCAC (SEQ ID NO:346) 
Reverse Primer 

Complement of 51818-51799 CTGCCGTCCCTTGAAGGCTA (SEQ ID NO:347) 

45 Fragment 7 

Forward Primer 

52932-52953 CCCTACCCTCAGGGATTTCTCA (SEQ ID NO:348) 
Reverse Primer 

Complement of 53474-53455 CCCATTCTCCTCTCCGAGCA (SEQ ID NO:349) 

50 
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Fragment 8 
Forward Primer 

53280-53299 ATCAGCGTGGTGCGATGTGT (SEQ ID NO:350) 
Reverse Primer 

5 Complement of 53756-53737 ACCCAGCTCTCTGGGACACG (SEQ ID NO:35 1) 

Fragment 9 
Forward Primer 

53548-53568 GGGATGAGTTCCCAAGTGCAG (SEQ ID NO:352) 
10 Reverse Primer 

Complement of 54048-54029 TGGCAAGCAGGCTTGAGAAG (SEQ ID NO:353) 

Fragment 10 • 
Forward Primer 

15 53841^53860 CCCCAACCTGAGCCAGAAAC (SEQ ID NO:354) 
Reverse Primer 

Complement of 54330-5431 1 TGTCCACAAGGGGGTCTGTG (SEQ ID NO:355) 

Fragment 1 1 
20 Forward Primer 

54083-54102 GGCTAGCAGTGGGGAAGAGG (SEQ ID NO:356) 
Reverse Primer 

Complement of 54617-54600 AtTCJCCAGGGGCAGGATG (SEQ ID NO:357) 

25 Fragment 12 

Forward Primer 

54422-54441 CCCTGTCATGGCCAGTCCIT (SEQ ID NU:358J 
Reverse Primer 

Complement of 54929-54910 GCGACCCAGTGCCCTCTACT (SEQ ID NO:359) 

30 

Fragment 13 
Forward Primer 

54694-54715 TGCATGTCCTCTTGTTGCTGAG (SEQ ID NO:360) 
Reverse Primer 

35 Complement of 55213-55194 CAATGACCACCCTCCCTGAA (SEQ ID NO:361) 

Fragm^t 14 
* Forward Primer 
55273-5529 1 CGGCTGTCAAGGGGTGTTC (SEQ ID NO:362) 
40 Reverse Primer 

Complement of 55769-55750 CCAAACCCAGACGGCAAGTT (SEQ ID NO:363) 



45 Analysis of Sequences for Polvmorohic Sites 

Sequences were analyzed for the presence of polymorphisms using the Polyphred program 
(Nickerson et al., Nucleic Acids Res. 14:2745-2751, 1997). The presence of a polymorphism was 
confirmed on both strands. :The polymorphisms and their locations in the IL4Ra gene are listed in Table 
3 below. 

50 Example IB 

This example illustrates examination of the IL4Ra gene for polymorphic sites in the following 
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targiet regions: lOUU base pairs upstream ot the Al'Ci start codon; each of the exons, including 
approximately 100 base pairs on either side of the exon; and 500-1000 base pans downstream of the 
termination codon. 



5 Amplification of the Target Regions 

PGR primers, which were designed based on the nearly complete IL4R genomic sequence 

reported in the GenBank database (Accession No: AC004525), are set forth below: 

Promoter 
Forward Primer 

10 30094-301 17 AAACCCCTGGACTCCAAGTGATCC (SEQ ID NO:364) 
Reverse Primer 

Complement of 30842,-30819 AAGCGATTCTTCTGCCTCAGCCTC (SEQ ID NO:365) 
PGR product 749 nt 

15 Exonl 

Forward Primer 

305 17-30540 GGACAGTTGTTGTGTAGCTCACCC (SEQ ID NO:366) 
Reverse Primer 

Complement of 31409-31431 CTATGTTGCCCAAGCTGACCTC (SEQ ID NO:367) 
20 PGR product 893 nt 

Exon 2 

Forward Primer 

32871-32890 GCCCTCAACTTTGCCTGCAC (SEQ ID NO:368) 
25 Reverse Primer 

Compl«nent of 33362-33340 AGTCCAGTATTCCAGCCGTATCG (SEQ ID NO:369) 
PCR product- 492 nt 

Exon 3 
30 Forward Prim©: 

35623-35644 TGATCGGGAAGCTGGAAGAGTC (SEQ ID NO:370) 
Reverse Primer 

Complement of 36070-36049 CGTTTCAAAGGTGGGAGGAGTG (SEQ ID NO:371) 
PCR product 448 nt 

35 

Exon4 

Forward Primer 

37230-37250 CGACCAAAAATCTGGGTGGTG (SEQ ID NO:372) 
Reverse Primer 

40 Complement of 37668-37648 CAGGAAGCAAAGGKJACTTGCC (SEQ ID NO:373) 
PCR product 439 nt 

Exon 5 

Forward Primer 

45 43307-43328 TCTTAAACATGGTGGGGTCAGC (SEQ ID NO:374) 
Reverse Primer 

Complement of 43765-43745 CATGKJAAATTGtGGGCTTGTG (SEQ ID NO:375) 
PCR product 459 nt 

50 Exon 6 

Forward Primer 

46507-46528 ATGTCKIAAGAGGGAGAGTGGTG (SEQ ID NO:376) 
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Reverse Primer 

Complem nt of 46870-46849 TGACTGAGAGGACTGCAAAGGG (SEQ ID NO:377) 
PGR product 364 nt 

5 Exon 7 

Forward Primer 

49672-49695 GCCTGATCTCTGATGCCAAATAAG (SEQ ID NO:378) . 
Reverse Primer 

Complement of 49964-49983 TTTGCCATTCCAGAAGCCAG (SEQ ID NO:379) 
10 PGR product 293 nt 

Exon 8 

Forward Primer 

51524^51546 GATCTGTGTGATGTCGfAGGCTTG (SEQ ID Nb:380) 
15 Reverse Primer 

Complement of 51845-51825 GTCCACGTITCCAGAACACCC (SEQ ID NO:381) 
PGR product 322 nt 

Exon 9 Fragment 1 
20 Forward Primer 

52912-52933 CGAAATCGCAAAGACACAGACC (SEQ ID NO:382) 
Reverse Primer 

Complement of 53722-53701 GAGTTGCTGAAGCTGCGGTAAG (SEQ ID NO:383) 
PGR product 811nt 

25 

. Exon 9 Fragment 2 
Forward Primer 

53352-53374 GAAAAAGGGAGCTTGTGTGCATC (SEQ ID NO:384) 
Reverse Primer 

30 Complement of 54172^54153 AACAAGGGGACAGGGACTGG (SEQ ID NO:385) 
PCR product 821 nt 

Exon 9 Fragment 3 
Forward Primer 

35 53854-53874 GAGAAACCTGGGAGCAGATGC (SEQ ID NO:386) 
Reverse Primer 

Complement of 547 1 1 -54688 GCAAGAAGAGGACATGGACCTAAG (SEQ ID NO:387) 
PCR product 858 nt 

40 Exon 9 Fmgment 4 
Forward Primer 

54266-54288 AAAGGTAGAGGACATGGGAAAGC (SEQ ID NO:388) 
Reverse Primer 

Complement of 55007-54987 GGAGCAGGGAAGAAGTCGTTC (SEQ ID NO:389) 
45 PCR product 742 nt 

These primer pairs were used in PCR reactions containing genomic DNA isolated from 

immortalized cell lines for a reference population of 70 human individuals. The PCR reactions were 

carried out under the following conditions: 

50 Reaction volume =20|il 

10 X Advantage 2 Polymerase reaction buffer (Glontech) - 2|il 

100 ngofhuman genomic DNA =1^1 

lOmMdNTT = 0.4^1 
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Advantage 2 Polymerase enzyme mix (Clontech) = 0.2|il 

Forward Primer (10 |iM) — 0.4 p.1 

Reverse Primer (10 (jM) = 0.4 |il 

Water =15.6|nl 

5 

Amplification profile: 
94**C-2min. 1 cycle 

94**C - 30 sec. 
10 70''C-45sec. L 10 cycles 



72**C - 1 min. 



} 



15 94°C - 30 sec. 

64'*C-45sec. i 35 cycles 



72"C-lmin. 



Se quencing of PGR Products 

• The PGR products were purified by Solid Phase Reversible Immobilization using the protocol 
20 . developed by the Whitehead Genome Center. A detailed protocol can be fovmd at 
http://www.genome.wi,mit.edu/sequencing/protocols/pure/SPRIjpcr.htmL 

Briefly, five of carboxyl coated magnetic beads (10 mg/ml) and 60 fil of HYB BUFFER (2.5M 
NaCl/20% PEG 8000) were added to each PGR reaction mixture (20 |il). The reaction mixture was 
mixed well and incubated at room temperature (RT) for 10 min. The microtitre plate was placed on a 
25 niagnetfor2minandthebeads washed twice with 150^ of 70% EtOH. The beads were air dried for 2 
min and the DNA was eluted in 25 ^1 of distilled vi^er and incubated at RT for 5 min. The beads were 
magnetically separated and the supernatant removed for testing and sequencing* 

The purified PGR products were sequenced in both directions using the primer sets described 
previously or those listed, in the 5 ' to 3 ' direction, below. 

30 

Promoter Fragment 1 
Forward Primer 

303 12-30334 GGTCAtTTAATCCCCACAACACG (SEQ ID NO:390) 
Reverse.Primer 

35 Gomplement of 30791-30769 GGAGCAGACGTOGGTAATTnTG (SEQ ID NO:391) 

Promoter Fragment 2 
Forward Primer 

30529-30551 TGTAGGTGACCCTGTGGACTTTG (SEQ ID NO:392) 
40 Reverse Primer 

Gomplement of 30990-3097 1 AATATGGAAGCCTGCGGTGG (SEQ ID NO:393) 

Exon 1 

Forward Primer 

45 30824-30846 TGAGGGAGAAGAATGGGTTGAAG (SEQ ID NO:394) 
Reverse Primer 

Gomplement of 31261-31240 AGTTGTCATTGGGTGTGCGGTG (SEQ ID NO:395) 
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£xon2 

Forward Primer 

32880-32900 TITCK^CTGCACnXSTGCTTTTG (SEQ ID NO:396) 
5 Reverse Primer 

Complement of 33234-33212 CCATACTCAGCATCCTGCACTCC (SEQ ID NO:397) 
Exon 5 

Forward Primer 

10 43330-43349 AACGACAGCAACCAGGGTGG (SEQ ID NO:398) 
Reverse Primer 

Complement of 43704-43682 CAGCAGGTGTATCTAATGGCAGG (SEQ ID NO:399) 

Exon 6 
1 5 Forward Primer 

46520-46541 AGAGTGGTGGGGAGATGAGGTG (SEQ ID NO:400) 
Reverse Primer 

Complement of 46857-46837 TGCAAAGGGGCAGACTAGAGG (SEQ ID NO:401) 

20 Exon 7 

Forward Primer 

49708r49729 CGACCACTTTTATGGGAGGAGC (SEQ ID NO:402) 
Reverse Primer 

Complement of 49927-49905 CCAGGTGTTCTGAACCACACTTC (SEQ ID NO:403) 

25 

Exon 8 

Forward Primer 

51528-51550 TGTGTGATGTCGAGGCTTGTACC (SEQ ID NO:404) 
Reverse Primer 

30 Complement of 51779-51758 GAATGCAGGGAAGAGAAGGCAG (SEQ ID NO:405) 

Exon 9 Fragment 1 
Forward Primer 

53017-53038 GCCATCAGGACATGGTGATITC (SEQ ID NO:406) 
35 Reverse Primer 

Complement of 53539^53518 TGAGCACTCGTACITCCCGAAG (SEQ ID NO:407) 

Exon 9 Fragment 2 
Forward Primer 

40 53378-53399 TGAGAGCAGCAGGGATGACTTC (SEQ ID NO:408) 
Reverse Primer 

Complement of 53948-53926 AAACTCCTGATAGCCACTGGTGG (SEQ ID NO:409) 

Exon 9 Fragment 3 
45 Forward Primer 

53869-53868 AGATCCTCCGCCGAAATGTC (SEQ ID NO:410) 
Reverse Primer 

Complement of 54583-54560 TTACTCTTCTCTGAG ATGCCCGAG (SEQ ID NO:41 1) 

50 Exon 9 Fragment 4 
Forward Primer 

54334-54355 TGGGCAGTGGCATTGTCTACTC (SEQ ID NO:412) 
Reverse Primer 

Complement of 54769-54750 TTCCAGGAGGTGGCATTTCC (SEQ ID NO:413) 

55 
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^>equracing reactions were pertonned using tbe Big-Oye terminator kit Irom ttL Biosystrais (Jh'oster City^ 
CA) according to the manu&cturer's instructions. The sequencing products were analyzed on an ABI 477 
automated sequencer (P£ Biosystems, Foster City, CA). 

5 Analysis of Sequences for Polvmorphic Sites 

Sequences were analyzed for the presence of polymorphisms using the Polyphred program • 

(Nickerson et al., 14 Nucleic Acids Res. 2745-2751, 1997).. The presence of a polymorphism was 

confirmed on both the strands. The polymorphisms and their locations in the IL4Ra gene are listed in 

Table 3 below. 
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Table 3. Polymorphic Sites Identified in the IL4Ra Gene 



Pdymorphic 
Site Number 


Nucleotide Position in 
GenBank Accession 


Nucleotide Position in 
Rguret 


Reference 
Allele 


Variant 
Allele 


Example 


PS1 


97137(Aoc^AC004525) 


32884 


A 


G . 


IB 


PS2 


971 18(Ax#AC004525) 


32903 


C 


T 


1A 


PS3 


97060(Acc#AC004525) 


32961 


G 


. T 


1A 


PS4 


96886(Acx#^VC004525) 


33135 


G 


C 


1A 


PSSf 


d4272(Acc«SAC004525) 


35749 


A 


G 


1A,1B 


PS6 


94258(Acc#AC004525) 


35763 


C 


- T 


1A1B 


PS7 


g4251(Acc#AC004525) 


35770 


G 


A 


1A 


PS8 


94204(Aa^fiAC004525) 


35817. 


T 


C 


1A 


PS9 


94116(Acc#AC004525) 


35905 


C 


T 


IB 


PS10 


94077(Acx:#AC004525) 


35944 


C 


T 


1A 


PS11 


94063(Acc^*AC004525) 


35958 


G 


A 


1B 


PS12 


92691(Acc#^C004525) 


37330 


G 


A 


1A 


PS13 


92548(Accft^\C004625) 


37473 


C 


T 


1A 


PS14 


92435(ACX3«5AC004525) 


37586 


C 


T 


1A 


PS15 


92430(AccttAC004525) 


37591 


G 


A 


1A 


PS16 


9241 7(Aoc#AC00452S) 


37604 


A 


T 


A A 

1A 


PS17 


<*V><>"n/ A .n^ruU-A C'OCX 

y2377(Acxw\C004525) 


37644 


C 


A 


A A 
1A 


PS18 


92343(Accfft^C004525) 


37678 


c 


T 


A A 

1A 




coot o^riuLTrr^VAA^H>uc^y 






A 
/A 


1A 


peon 




43703 


J 


c 


1A 


PS21 


77013(Acc#AC004525) 


53008 


A 


G 


1A 


PS22 


76922(Atx:ttAC004525) 


53099 


C 


T 


IB 


PS23 


76868(Acx:#AC004525) 


53153 


T 


0 


1A 






5341 


A 




1A 










T 






76516(AX#AIC(X)4525) 


53605 


"T* 

T 


C 


A A AO 

1A, IB 


PS27 


7o514{Acx»\jOu04525) 


53507 . 


C 


T 

T 


A A 

1A 


PS28 


7oo0o(ACX>ffi<\C004525) 


ooolo 


T 




A A 

1A 


PS29^ 


763(XXAcx3#AC004525) 


53721 


T 


C 


1A, IB 


PS30 


76106{AccftAC004525) 


53916 


G 


T 


1A 


PS31'* 


76080(Acc#AC0045;») 


53941 


A 


G 


1A. IB 


PS32 


76072(Acx!tt«\C004525) 


53949 


G 


A 


1A 


PS33 


f O7o4(A0Cffi^lUU04a<d!o} 






T 
1 


4 A 








1 




1A, ID 


PS35 


7541(XAccftAC004525) 


54611 


T 


c 


1A 


PS36 


75323(Acctt^C004625) 


54698 


T 


C 


1A, IB 


PS37 . 


75321(Acx;#AC004525) 


54700 


T 


c 


1A 


PS38 


76280(Arc#Aa)04525) 


54741 


C 


■ T 


1A 


PS39 


75241{Acc#«VC004525) 


54780 


c 


G 


1A 


PS40 


74938(Acc«Aa)04525) 


55083 


A 


G 


1A 


PS41 


74879(Acctt^\C004525) 


. 55142 


G 


A 


1A 


PS42^ 


74693(Acx^!mX)4525) 


55328 


G 


A 


1A 


PS^ 


.74591(ACX3#AC004525) 


55430 


C 


T 


1A 


PS44 


74482{AccMC004525) 


55539 


C 


T 


1A 


MS45 


74263(Ac(maX34525) 


bb/b8 


G 


A 


iA 



Previously lepoxted in the literatme 
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Example 2 

This example illustrates analysis of fhe IL4Ra polymoiphi^s identified in the Index 
Repositories for human genotypes and haplolypes for all polymorphic sites except PSl, PS9» PSl 1» PS21, 
PS22,andPS23. 

A sampling of different genotypes containing these polymorphisms that were observed in these 
reference populations are shown in Table 4 below, with the haplotype pair indicating the combination of 
haplotypes determined for the individual using the haplotype derivation protocol described below. In 
Table 4, homozygous positions are indicated by one liucleotide and heterozygous positions are indicated 
by two nucleotides. Missing nucleotides in any given genotype in Table 4 can typically be inferred based 
on linkage disequilibrium and/or Mendelian inh^tance. 
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The haplotype pairs shown in Table 4 were estimated from the unphased genotypes using an 
extension of Clark's algorithm (Clark, A.G. (1990) Mol Bio Evol 7, 1 1 1-122), as described in U.S. 
Provisional Patent Application filed April 19, 2000 and entitled "A Method and System for Determining 
5 Haplotypes firom a Collection of Polymorphisnis*'. In this method, haplotypes are assigned directly from 
individuals who are homozygous at all sites or het^zygous at no more than one of the variable sites. 
This list of haplotypes is thm used to deconvolute the unphased genotypes in the remaining (multiply 
heterozygous) individuals. 

By following this protocol, it was determined that the Index Repositories examined herein and, 
10 by extension, the general population contains the 53 human IL4Ra h^lotypes shown in Table 5 below. 
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In view of the above, it will be seen that the several advantages of the inv ntion aie achieved and 
other advantageous results attained 

As various changes could be made in the above methods and compositions without departing from 
5 the scope of the invention, it is intended that all matter contained in the above description and shown in 
the accompanying drawings shall be interpreted as illustrative and not in a limiting sense. 

All references cited in this specification, including patents and patent applications, are hmby 
incorporated in their entir^ by reference. The discussion of ref^ences herein is intended merely to 
summarize the assertions made by their authors and no admission is made that any reference constitutes 
10 prior art. AppUcants reserve the ri^ to challenge the accuracy and pertinency of the cited references. 
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What is Claimed is: 

1. An isolated polynucleotide comprising a nucleotide sequences lected fiom the group consisting of: 

(a) a first nucleotide sequence which is a polymoiphic variant of a reference sequence for Interleukin 
4 Rec^tor Alpha(IL4Ra) gene or a fragment thereof, wherein the reference sequence comprises 
SEQ ID NO: 1 > and the polymoq)hic variant comprises at least one polymorphism selected from the 
group consisting of guanine at PSl, thymine at PS2, thymine at PS3, cytosine at PS4, thymine at 
PS6, adenine at PS7, cytosine at PS8, thymine at PS9, thymine at PSIO, adenine at PS 1 1. adenine 
at PS12, thymine at PS13, thymine at PS14, adenine at PS15, thymine at PS16, adenine at PS17, 
thymine at PS18, adenine at PS19, cytosine at PS20, cytosine at PS21, thymine at PS22, cytosine at 
PS23, thymine at PS25, thymine at PS27, cytosine at PS28, thymine at PS30, adenine at PS32, 
thymine at PS33, guanine at PS34, cytosine at PS35» cytosine at PS36, cytosine at PS37, thymine at 
PS38, guanine at PS39, guanine at PS40, adenine at PS41, thymine at PS44, and adenine at PS45; 
and 

(b) a second nucleotide sequence which is complementary to the first nucleotide sequence. 

2. The isolated polynucleotide of claim 1 which comprises an IL4Ra isogene. 

3. The isolated polynucleotide of claim 1 which is a DNA molecule and comprises both the first and 
second nucleotide sequences and further comprises expression regulatory elements operably linked 
to the first nucleotide sequence. 

4. A recombinant organism transformed or transfected with the isolated polynucleotide of claim 1, 

wherein the organism expresses an IL4Ra proteia encoded by the first nucleotide sequence. 

. The recombinant organism of claim 4 which is a nonhuman transgenic animal. 

i. The isolated polynucleotide of claim 1, wherein the first nucleotide sequence is a polymorphic variant 
of a fragment of the IL4Ra gene, tiie fragment comprising one or more polymorphisms selected 
fix>m the group consisting of guanine at PSl, thymine at PS2, thymine at PS3, cytosine at PS4, 
thymine at PS6, adenine at PS7, cytosine at PS8, thymine at PS9, thymine at PSIO, adenine at PSl 1, 
adenine at PS12, thymine at PS13, thymine at PS14, adenine at PSIS, tiiymine at PS16, adenine at 
PS 17, thymine at PS 18, adenine at PS 19, cytosine at PS20, (^osine at PS21, thymine at PS22, 
cytosine at PS23, thymine at PS25, thymine at PS27, cytosine at PS28, thymine at PS30, adenine at 
PS32, thymine at PS33, guanine at PS34, cytosine at PS35, cytosine at PS36, cytosine at PS37, 
thymine at PS38, guanine at PS39, guanine at PS40, adenine at PS41, thymine at PS44, and adenine 
atPS45. 

An isolated polynucleotide comprising a nucleotide sequence which is a polymorphic variant of a 

reference sequence for the IL4Ra cDNA or a finagment tiiereof, wherein the reference sequence 

comprises SEQ ID NO:2 and the polymoiphic variant comprises at least one polymorphism selected 

from the group consisting of thymine at a position corresponding to nucleotide 237, adenine at a 

position corresponding to nucleotide 244, cytosine at a position corresponding to nucleotide 291, 

thymine at a position corresponding to nucleotide 501, adenine at a position correspnndinff to 
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nucleotiae ^^4, C3^sme at a position conresponding to nucleotide 939, tbymine at a position 
corresponding to nucleotide 1242, thymine at a position corresponding to nucleotide 1293, cytosine 
at a position corresponding to nucleotide 1299, thymine at a position corresponding to nucleotide 
1701, adenine at a position corresponding to nucleotide 1735, thymine at a position corresponding to 
nucleotide 2023, guanine at a position corresponding to nucleotide 2254 and cytosine at a position 
corresponding to nucleotide 2397. 

8. A recombinant organism transformed or transfected with the isolated polynucleotide of claim 7, 
wherein the organism expresses a Interleukin 4 Receptor Alpha(IL4Ra) protein encoded by the 
polymorphic variant sequence. 

9. The recombinant organism of claim 8 which is a nonhuman transgenic animal. 

10. An isolated polypeptide comprising an amino acid sequence which is a polymorphic variant of a 
referrace sequence for the IL4Ra protein or a fragment thereof, wherein the reference sequence 
comprises SEQ ID NO: 3 and the polymorphic variant comprises one or more, variant amino acids 
selected from the group consisting of threonine at a position corresponding to amino acid 82, 

' histidine at a position corresponding to amino acid 1 85, isoleucine at a position corresponding to 
amino acid 579, serine at a position corresponding to amino acid 675, and alanine at a position 
corresponding to amino acid 752. 

11. An isolated antibody specific for and immunoreactive with the isolated polypeptide of claim 10. 

12. A method for screening for drugs targeting the isolated polypeptide of claim 10 which comprises 
contacting the IL4Ra polymorphic variant with a candidate agent and assaying for binding activity. 

1 3. A composition comprising at least one genotyping oligonucleotide for detecting a polymorphism in 
the Interieuldn 4 Receptor Alplia(IL4Ra) g^e at a polymorphic site selected from PSl, PS2, PS3, 
PS4, PS6, PS7, PS8, PS9, PSIO, PSll, PS12, PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, 
PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35, PS36, PS37, PS38, PS39, 
PS40, PS41, PS44, and PS45. 

1 4. The composition of claim 1 3, wherein the genotyping oligonucleotide is an allele-specific 
oligonucleotide that specifically hybridizes to an allele of the IL4Ra gene at a region containing the 
polymorphic site. 

15. The composition of claim 14, wherein the allele-specific oligonucleotide comprises a nucleotide 
sequence selected from the group consisting of of SEQ ID NOS:4-79, the complements of SEQ ID 
NOS: 4-79. and SEQ ID NOS:80-231. 

1 6. The composition of claim 1 3, wherein the genotyping oligonucleotide is a primer>extension 
oligonucleotide. 

17. A method for genotyping the Interieuldn 4 Recq>tor Alpha(IL4Ra) gene of an individual, 
comprising determining for the two copies of the IL4Ra *gene present in the individual the identity of 
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the nucleotide pair at one or more polymoiphic sites (PS) selected &om FS1» FS2, PS3, PS4, PS6, 
PS7, PS8, PS9, PSIO, PSl 1, PS12, PS13, PS14, PS15. PS16, PS17, PS18, PS19. PS20, PS21, PS22, 
PS23, PS25, PS27. PS28, PS30, PS32, PS33, PS34, PS35, PS36. PS37, PS38, PS39. PS40, PS41, 
PS44,andPS45, 

18. The method of claim 1 7, wherein the determining step comprises: 

(a) isolating from the individual a nucleic acid mixture comprising both copies of the IL4Ra gene, 
or a fragment thereof, that are present in the individual; 

(b) amplifying from the nucleic acid mixture a target region containing at least one of the 
polymorphic sites; 

(c) hybridizing a primer extension oligonucleotide to one allele of the amplified target region; 

(d) performing a nucleic acid template-dependent, primer extension reaction on the hybridized 
genotyping oligonucleotide in the presence of at least two diffeient terminators of the reaction, 
wherein said terminators are complementary io the alternative nucleotides present at the 
polymorphic site; and 

' (e) detecting the presence and identity of the terminator in the extended genotyping 
oligonucleotide. 

19. A method for haplotyping the Interleukin 4 Receptor Alpha(IL4Ra) gene of an individual which 
comprises determining, for one copy of the IL4Ra gene present in the individual, the identity of the 
nucleotide at one or more polymorphic sites (PS) selected from PSl , PS2, PS3, PS4, PS6, PS7, PS8, 
PS9, PSIO, PSl 1, PS12, PS13, PS14, PS15, PS16, PSl 7, PS18, PS19, PS20, PS21, PS22, PS23, 
PS25, PS27, PS28, PS30, PS32, PS33, PS34, PS35, PS36, PS37, PS38, PS39, PS40, PS41, PS44, 
andPS45. 

20. The method of claim 19, wherein the determining step comprises 

(a) ' isolating from the individual a nucleic add molecule containing only one of the two copies of 

the U^Ra gene, or a fragment thereof, that is present in the individual; 

(b) amplifying from the nucleic acid molecule a target region containing at least one of the 
polymorphic sites; 

(c) hybridizing a primer extension oligonucleotide to one allele of the amplified target region; . 

(d) performing a nucleic acid template-dependent, primer extension reaction on the hybridized 
genotyping oligonucleotide in the presence of at least two different terminators of the reaction, 
wherein said terminators are complementary to the alternative nucleotides present at the 
polymorphic site; and 

(e) detecting the presence and identity of the temunator in the extended genotyping 
oligonucleotide. 

21 . A method for predicting a haplotype pair for the Interleukin 4 Receptor Alpfaa(IL4Ra) gene of an 
individual comprising: 
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(a) identifying an ILAKa graotype for the individual at two or more of polymorphic sites selected 
firom PSl, PS2, PS3, PS4, PS6, PS7» PS8, PS9, PSIO. PSll, PS12, PS13, PS14, PS15, PS16, 
PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS25, PS27, PS28, PS30, PS32, PS33, PS34, 
PS35. PS36, PS37, PS38. PS39, PS40, PS41, PS44, and PS45; 

(b) enumerating all possible haplotype pairs which are consistent with the genotype; 

(c) accessing data containing the IL4Ra haplotype pairs determined in a reference population; and 

(d) assigning a haplotype pair to the individual that is consistent with the data. 

22. A method for identifying an association between a trait and at least one genotype or haplotype of the 
Interleukin 4 Receptor Alphagene which comprises comparing the frequency of the genotype or 
haplotype in a population exhibiting the trait with the frequency of the genotype or haplotype in a 
reference population, wherein the genotype or haplotype comprises a nucleotide pair or nucleotide 
located at one or more polymotphic sites selected firom PSl, PS2, PS3, PS4, PS6, PS7, PS8, PS9, 
PSIO. PSl 1, PS12. PS13, PS14, PS15, PS16, PS17, PS18, PS19, PS20, PS21, PS22, PS23, PS25, 
PS27, PS28, PS30, PS32, PS33. PS34, PS35, PS36, PS37, PS38, PS39. PS40, PS41, PS44, and 
PS45, wherein a higher frequency of the genotype or haplotype in the trait population than in the 
reference population indicates the trait is associated with the genotype or haplotype. 

23 The method of claim 22, wherein the haplotype is selected from haplotype numbers 1-53 shown in 
Table 5. 

24. The method of claim 23, wherein the trait is a clinical response to a drug targeting IL4Ra . 

25. A computer system for storing and analyzing polymorphism data for the Interleukin 4 Receptor 
Alphagene, comprising: 

(a) a central processing unit (CPU); 

(b) a conmiunication interfiace; 

(c) a display device; 

(d) an input device; and 

(e) a database containing the polymorphism data; 

wherein the polymorphism data comprises the genotypes and haplotype pairs shown in Table 4 and 
the haplotypes shown in Table 5. 

26. A genome anthology for the Interleukin 4 Receptor Alpha(IL4Ra) gene which comprises IL4Ra 
isogenes defined by haplotypes 1-53 shown in Table 5, 
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POIiYMORPHXSMS IN THE II.4Ra GEME 



TGTGAGCTAC TGTGTCTGGC CTGAATAATA AAATTTAAAA CAATTXiXUA 
AAAATTCACC ATGAGGTCTC ACTATATTCC CTAGGCTGGT CTCAAACCCC ■ 30100 
TGGACTCCAA GTGATCCACC CCACCTTCCC GAGTAGCTGG GACTAGAGAT . 
GCACACCATT GCACCCAATA GAGCAATACG TTTCTGTTCT TTGTAAATTA 30200 
CCTGCTCTAA GGTATTTTTG TTATAGCAGC CTATATGGAC TAAGCTGACT 
TGTAACGTTA CTTGAGACTT TAAAGTGTTC CGGTCACTGT TGGAGGGCTC 30300 
TGTCTGTGTT AGCTCATTTA ATCCCCACAA CACCTCAATC AGATGGGGCT 
ATTCTTAGTC CCACTTTATA GATAAGGAAA CTGAGGCATG GAAGCACAGC .30400 
TTGCTCAAGG TTCACATCTA GTCAGTGACA GAGCAGGTAT TTAAACCTCA. 
GGAAATAATC AGAGAAACAT GTGTAGAGGG TTGTCCAAGG AAGGCCACAT 30500 
CCAGAAGCAT CTCCCAGGAC AGTTGTTGTG TAGCTCACCC TCTGGACTTT 
GTGGGTGTGG GTGTTGTTTC ATGATTATAG AGAGAGCTCT GTGAACGTGG 30600 
AGGACCTGTT GTCGGCAGAG ACACAAATGG CCAGGGCATG GCTGGGCAGC 
CGCAGTGGCT CAGGCCTGTA ATCCCAGCAC TTCGAGAAGA CCAGAGGGGC 30700 
AGATCATGAG GTCAGAAGTT CAAGACCAGC CTGGCCAACA TGGTGAAACC 
CCGTCTCTAC TAAAAATACA AAAATTAGCC AGGTGTGGTG GTGGGCACCT 30800 
GTAATCCCAG CTACTCGGGA GGCTGAGGCA GAAGAATCGC TTGAACCCGG 
GAGGTGGAGG TTGCAGTGAG CTGAGATTGC ACCACTGCAC TCCAGCCTTG 30900 

. G 

GAGACAGAGC GAGACTCTGT CTCGGAAAAA CAAACAAACA' AGCAAACAAA 
CAAACAAATA AATGGCCAGG GCAGGGGAGG GTTGCATATT GAATAAGATG 31000 
AGCTCTGCTG. GAAGCACAGG TCAGCACTAA CCTGCTTCCT CTCTCTCTGC 
AGGTGCCTTG GCATCTCCCA ATGGGGTGGC TTTGCTCTGG GCTCCTGTTC 31100 

[exon 3: 31071. . 
CCTGTGAGCT GCCTGGTCCT GCTGCAGGTG GCAAGCTCTG GTAAGTCACC 
..31140] 

ACTTCTCAAT CATTCATTTG TTGGCTATTA ATGGCGTGCC AGGGTCCTGC 31200 
AGTATGTCAC CTGGCCTTAT GGAGATTACA CTGCAGTGGG AGGGGACAGC 
CAATGAC/^G TGGCCCTGAT TATCAGTAAA TTCTAAAGAT TGTTAGAAAG 31300 
TGATGGGAGC CGGGTGCAGT QGCTCACACC TGTAATCCCA GCACTTCAGG 
AGGCCGAGGC AGGAGGATCG CTTGAGCCCA qGAGTTCGAG GTCAGCTTGG ' 31400 
GCAACATAGG GAGAC.CTTGT CTCTACAAAT ^[aTAAAATAT TAGCCAGGTG 
TGGCAGTGCA CGCCTGTAGC CCCAGCTACT CAGGAGGCCG AGGTGGGAGG 31500 
ATCCCTTGAA CTGAGGAGGT CAAGGCTGCA GTGAACTGTG ATCGCGCCAC 
TCCACTCCAG CCTGCGTGAG AAAGTGAGAC CCTGTCAAAA AAAAAGAGAA 31600 
GGTGATGGGG AAAGT^CACA GAACAGCATA AGAGGGGGTT GGGGAAGCTG 
GGTGGAGTGG GGGGGATTGC AGTTGAAAGT AGGGAAGTCA GGGAAGGCCT 31700 
CATTGAGCTG ACTTGGAGGA AGCGGGAACC GTGCAGATGT CTGGGGAAGG 
CTCATTCTTG GCAGAGAGGC CCTGCACTGA GCCTGGCGGG AGGGTTGAGC 31800 
ACAGGAGGGA ATGTGGTGGA GGAGAGTGAG CAGCAGGAGG GAGCAGTGAA ' 
GGTCAGCAAG GTGACAGAGT GGCTGAATCA AAA7VAGACCT TGCAGTGTTT 31900 
GAGCAGAGGA TCCATATCAT CCATTATGTT CCAAAGGACT CTTCAGGATG 
CCGTGTGGAG AAAGGAAGAG GGTGGT^GCC AGGAGGTCTG GAGGGAGGTC 32000 
TGGAGTGGAG GAGATGAGAG GCTCCGGATC CCTCTGGGAG GTAGATTTGA 
•GGACAGATTG GAATTGAGGT GAAAGACAGA GAAAGAGAAG TGGCCAGGAT 32100 
GACTCCAAGA TTTCTGACCT AAACTACTGG GAAGGACGCG GTTGTCATTT 
CTGAAATGCA GAAGGATGCC AGAAGAGAAG GTACTTTGGG GAGGGGCGGG 32200 
AATCAGGAGT TAGTTTTGGA CATGAGATAA GCTTGGAATA TTTATTTGCT 
ATCTAAGACA GCTCCTTAAC ATGGTAAGCC CTTATGCAAG TTGTTGTCAG 32300 
CTGAGATGGG CGTGGCACTG AGCATGGGAG CATGGAGGCG CCTGAGTGGT 
CTCATGCTCA GGTGGTTTAG CAAACTCAGT GTACATCCTG CCAATTCCAG 32400 
TCCTGCCATG GCCACTGACA AGCTAGGAGG GCGCTGAAAG GAGAAGGACC 
CCGATGTCTC CTCCAGCCCA TCCATCTCCT CTCTCCCATT GGCCAAACCC 32500 
AACCGGAAAC TAAAGGCCAA GGGTACCCGG TGATGAAGAC TGTGGTATCA 
GCCTCCTGAG CACAGAGAGG GCAGAAAGGG GTGGAGACAA AGAGGGGCGC 32600 

FIGURE lA 



1/15 



wo 01/04270 



PCT/USOO/19094 



AGATAGTGGG 
GCTTGGTGTG 
AATTTGGCCA 
GGTTCGTTCT 
CCACAGTCAT 
CCCCAGCTCT 
TGCTATTCCC 
T 

TTCCTGGGCC 

CTTGCAGGAG 
AGTGGAAGAT 
TACCAGCTGG 

TTTGGGGAGG 

GGTGGTGCGC 

TGGTGTTAGA 

GTGGAGCGCA. 

GAATACTGGA 

TTTTCACACT 

CACATCATTC 

AATTTTTAAA 

AGGTCTCACT 

TCCTCCTGCC 

GTATCCGGCC 

GCATTCAACA 

CTAGAT^AATT 

GTCCCTGTAT. 

TTTGAGAACC 

TGGAGTCCTC 

GGTGGTTCCC 

TGCCCAGGAC 

GGGATATACG 

CATGATTATT 

CCAGGGATTG 

TTTTTGTAAA 

AGAAACTGCA 

TGTTTTCTTT 

TGGCACAATC 

CTCCTGTCTC 

ACCCGGCTAA 

GCCAAGCTGG 

TCCCAAAGTG 

TTTATCTTAA 

CTCACTTATA 

TGTACCATGG 

GAAGGAAGAG 

GACCCACCAG 

CAGATCTGAG 

GTGGCTCGCG 

CACTTGAGGT 

ATCTCTACTA 

AATCCCAGCT- 

GGTGGAGGTT 

GACAGTGAGA 

ACTCGTACCC 

GAAATTGTGT 



UAAATGGGGA 
ATGGAATGGC 
AGAACAAGTT 
TGTCTAATGG 
CCCGACACTA 
GTGGGCGCTG 
CTTGGTCCTG 



AGTGGCACTT 
ACTCCTTAAA 
GTCAAGTTTG 
CCCCCGCACC 
GCTGGGAAGC 
GCCCTCAACT 
TTTGGGTGCA, 



CCCCTAGCTC 
CTGCTACATA 
TGTGAGATAA 
CATGGGTATT 
TCCAGCAGCC 
TTGCCTGCAC 
AGTCCCCCTC 



GCTCAGGCTG CTCCTGTGTC TCCCCAGGGA 
T ' [exon 4: 32988 . . 
CCCACCTGCG TCTCCGACTA CATGAGCATC 
GAATGGTCCC ACCAATTGCA GCACCGAGCT 
TTTTTCTGCT CTCCGAGTAA GCCTGCGCTG 

.,33126] C 
TTGTGCCCAA AGGGTTTGCC CCAAGAGTGA 
TGGAGTGCAG GATGCTGAGT ATGGTTTGCT 
GGGGAGGTCC CATCTCCAGG GACATGTTAT 
TGGTGGGAGT GTTGGTCCAC GTGGCACATG 
CTAGACCAGC AGTTCTCACA CTTTTTGGTC 
TAAAAATGAG TGAGGACCCA AAGGGCTTTG 
TATGTTTACC TAATTAGAAC TTGCAATGAA 
AAATTAAAAC AATTAAAAAT TTTTTTTCTT 
GTGTTGCCCA GGCTGCTCTC AAACTCCTGG 
TCCGCCTCCC AAAGTGCTGG GATTACAAGC 
CAAAATGGAG AAATTTTAAG TCCCAACAAC 
AATCTTCAGA TCAATTACAT GATCACAGGT 
CCACTGTACG CCAGTGAGAG AGAGTGAAAA 
TATGATGAAA AGAGTTTTAC CTGGTGGGCC 
ACTGGACTAG ACCCTTGATT GAGGAGTACG 
TGTGATGGTG GATGGACCAG GACACATGGC 
TGGGCTACTC CATGGTGCAC AGGATGCTTC 
ATT^TCACGT ACACAAGACA CACAGTTACG 
GCACACCAGC ATGCAGCGTT CACCAGTAAA 
CTAAGGTAGA TGGGCTGTGC TTTGTTTCCA 
GCTU^CTATG GCCCGTGAGC CAAATCCGGC 
TAAAGTTTTA TTGGT^CACA CTGGCTGCTG 
TGGCCCTCCT TTATGTTTTT TGTTTGTTTG 
GAGACAGAGT TTCGCTCTTG TTGCCCAGGC 
TCGGCTCACT GCAACCTCTG CCTCCCGGGT 
AGCCTCCCGA GTAGTTGGGA TTAATGGTGC 
TTTTTCGTAT TTTTAGTAGA GACCGGTTTT 
TCTCGAACTC CTG7UVCTCAG GTGATCCAGC 
CTGGGATTAC AGGCATGAGC CACTGAGCCC 
TTGAAATAAT TCAGAAATGG AAAGTCAAAT 
AGTAAGAGTT AAATAATGTG TACACATGGG 
AATAACAGAC ATTGAAGACT TGGGAGGGTG 
AAGTTACTTA ATGGGCATAG TGTACACCAT 
AACCCCAGAC TTCACCACTA GGCAGCATAT 
GCTTGCCATC AAAATTGCAC TTGTAAGGCC 
GCTGTAATCC CAGCCCTTTG GGAGGCCGAG 
CAGGAGTTCG AGACCGGCCT GGCCAACATG 
AAAATACAAC AATTAACTGG GT?GTAGTGGC 
ACTAGGGAGG CTGAGGCGGG AGAATTGCTT 
GCAGTGAGCC GAGATCACAT CACTGTACTC 
CTTTGTCTCA GGAAAAAAAA ACAAAAACAA 
CCTAAATTTA TACAAATAAC CAAAAAAAAA 
GGCCTTTGAA GTCCAAAATA TTAACTATCT 
HGUREIB 



GAGGGCAGAG 
TTTTCCCTTT 
AGGTGCACTT 
TCTTCAGCTT 
CTGGTCCTGG 
TGTGCTTTTG 
ACGCATTGAG 

ACATGAAGGT 

TCTACTTGCG 
CCGCCTGTTG 
GAGCTGGAGG 

GCTGGGTCCA 
GCTGTTTATA 
GTAAGATACA 
GATACGGCTG 
TCAGGACCCT 
GTGTAGGTAA 
GAAATGGTGT 
ACTGAAATGG 
GCTCCAGTGA 
GTGAGCCGCT 
ATGCAAGCCC 
CATGTAGCCT 
GGCAAATAAC 
CAGACCACAC 
GTGTTGAGAG 
ATAGGAGTCA 
GTTACACTGG 
GGGCAGACTG 
GGTGGTATTC 
TTGGCTTAGT 
CCACTGCTTG 
TAGTTGTAAC 
TTTGTTTGTT 
TGGAGTGCAG 
TCAAGCGATT 
CTGCCACCAC 
CATCATGTTG 
CGCCTCAGCG 
GGCCTCCTCC 
ACTGCATGTT 
CATTATTCCA 
GGAGAGGGGT 
TTGGGTGACG 
CCAGTGAGAA 
GGGCACTGTG 
GTGGGCAGAT 
GTGAAGCTCC 
GCACACCTGT 
GAGCCCAGGA 
TAGCCTGGGT 
AAAACATU^ 
AAAAAAAAAG 
GGCCTGTTAC 



32700 
32800 
32900 

33000 

33100 

33200 
33300 
33400 
33500 
33600 
33700 
33800 
33900 
34000 
34100 
34200 
34300 
34400 
34500 
34600 
34700 
34800 
34900 
35000 
35100 
35200 
35300 
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AGAAAAAGTT 
TTAAGGTGGA 
GAGCTTATGT 
GGGCTTTCAG 
GGTGTGAGGA 
AGAGGCAACC 
CCCAGATCTG 
GGTTCCTGGA 
AGGTGACCAG 

[exon 
CCCTGAGAAC 

ACGTGGTCAG 

CTGCTGTGGA 

AGTGCGGCAG 
T 

TGGGGTGGGC 
A 

GCCTGTATTT 
GTCCTCCCAC 
TTCAGCCCAG 
TGAGATGAGG 
GAACTGGGAA 
TTTGGATGGT 
CCCTGCAGGA 
AAACATGCGT 
TAAAAACCTG 
TAAAACAGAT 
GTACAGAGAT 
TTTGAGGCCG 
GCCGAGCCCA 
AACATGGTAA 
TGGTGGCAGG 
ATGGCTTGAA 
TGCCCTCCAG 
ATAAATAAAA 
GGCTCATACA 
CGCGGTGGCT 
AGATCACTTG 
CACTGCACTC 
GAAAAAAAGG 
ACATTATTGT 
GCTCCAGAAG 
GCTGTGGTTT 
TAGATCCTCA 

[exon 
GGAAACCTGA 
GAGCAACCCG 
CAGTCAACAT 



TGCAGACCCC 
ACATTGGAAT 
TGTTCTAAAT 
GAACAAGCTA 
GCATCCTTGG 
CTAGTCGGCT 
TCCTCACATC 
GGCATGTCCC 
CCTAACCCAG 



TGGCCTAGCC 
TATCTTACGA 
TAATTAGTGC 
AGTCCCAGGA 
TGCATGTGGT 
GCCCCAGCAC 
CGTGATCGGG 
GGACACAGCT 
CCCCTGTGTC 



CGTGAGATGT 
TGGCCA2W^CT 
CACCGGTTCT 
CCAGGGCCGG 
AAGAGGCTGT 
ACCCTGGCCG 
AAGCTGGAAG 
GTGGGGCCCA 
TGCAGAGCCC 



GGGTTGGCTG 
GTGCGATGCA 
TCCCTTTCAT 
CAGCTAGGCA 
GGCCAGCAAG 
CTCCCAAGCC 
AGTCTGATGC 
GCCAGCCTAC 
ACACGTGTAT 
G 



5: 35736.. 

AACGGAGGCG CGGGGTGCGT 

T A 
TGCGGATAAC TATACACTGG 
C 

AGGGCTCCTT CAAGCCCAGC 

. .35887] 
GGGTGGCTGG GTGTGTTCCC 

AGGGGAGGAG GTGGGGTCAT 



GTGCCACCTG CTCATGGATG 

ACCTGTGGGC TGGGCAGCAG 

GAGCATGGTG AGCAGGGCGG 

ACAGCTGCCT GGGCTGAGGG 
T 

AGCAACAGCA GGAGGAAGCC 



TTGACGTTTT 



TCCCAAATCT 
CTTTGAAACG 
CTGTTTCCAC 
TGTGCTTGCT 
CATTCCTTCC 
CTTGACCGTG 
ATCCCCCTTG 
TGTATTCCAG 
AGGTCTCATG 
GAATCAAAAT 
GTGGCCAAAG 
GGTGTGGTGG 
GCGGATCACC 
AACCCCGTCT 
CGCCTGTAAT 
CCCAGGAGAA 
CCTGGGCGAC 
ATAAAAAGAT 
CTTTGCCTTC 
CATGCCTGGA 
AGCCCTGGGC 
CAGCCTGGGC 
GTAATTAATA 
TATTATAATC 
TATGCTATGG 
GTAGGCCGGG 
CATAGAGGCC 

6: 37334., 
CAGTTCACAC 
TATCCCCCTG 
TTGGAGTG/y^ 

. .374 
TCTGTGACCT 



GATGGGATTC 
GAGCTGGTCG 
CCCTGAACTT 
GGAAGGCATG 
ATTCTGTGTC 
GAAGGCTGAC 
AGCTTGCTGG 
TGATGCATGA 
GATATGATTG 
AAAGAAAAAT 
GTGTGAAGGA 
TTCATGCCTA 
GGAGGTCAGG 
CTACTAAAAA 
CCCAGCTACT 
GGAGGTTGCA 
AGAGTGAGAT 
TTTTTAAAAG 
ACTGTAGAAA 
ATCCCAGCAC 
TATTGAGGCT 
AACAGAGTGG 
AACACTAAAG 
TTCTTTGCTA. 
AGGTTTTGTC 
GCTGGGCTGG 
GCTTCTCCCG 
A 

CAATGTCTCC 
ACAATTACCT 
AACGACCCGG 
T 

85] 

CTGGGGAACA 



CTGCCCCTGC 
CAGTAGACCA 
AAGTGCCCAG 
CCTGCTGCTG 
CACTGGTCAG 
CTTCTTCTGG 
GCTGTGGTGA 
TATGACATGC 
CTTCAAAGGA 
ACTCAGTAAA 
TGCAGCTGTA 
TAATCCCAGC 
AGTTCGAGAC 
TACAAAAAAT 
TGGGAGGCTG 
GTGAGCTTAG 
TACGTCTCAA 
GCTGAAGTTT 
GGTGGTTAGT 
TTTGGGAGCC 
GCAGTGAGCT 
GACCCTGTCT 
TTCTATGTAG 
TGGCTCTGAA 
GACCAAAAAT 
GTGATGGGGG 
CAGTGAAACC 



CTGGGCCTCA 
CCAAGCCCCC 
GAAGGCGTAT 
ATTGAAAACC 
CTGCTGCGGC 
TACCCGGAGT 
CAGGAGTTTA 
ATCACAGGAA 
GACC/^GTTT 
TCATCATAAA 
AAAGCTGAAG 
ACTTTGGGAG 
CAGGCTGGAC 
TAGTCTGGCA 
AGGTAGGAGA 
ATCATGCTAC 
AAAAATAAAA 
GGGTTACTTT 
AAAGACCAGG 
CAGCGCAGGC 
GGGATTGTGC 
CAAAAAAGAA 
AATTTTAGCA 
TCTGTGTGGT 
CTGGGTGGTG 
AGTCACTGCA 
CAGGGCCCCA 



GACACTCTGC TGCTGACCTG 
GTATAATCAT CTCACCTATG 
CAGATGTGAG TGGGCATGCT 



GGGTGGGTGA CCAGCAGAGG 



35400 
35500 
35600 
35700 

35800 

35900 

36000 

36100 
36200 
36300 
36400 
36500 
36600 
36700 
36800 
36900 
37000 
37100. 
37200 
37300 

37400 
37500 
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CCCAGTCCCT 

CAAATCCCAG 
T 

AAGTCCCTTT 

XATCTCTGCA 
CTGTGTTTCC 
AGCTCACTTG 
ACAGTGACCC 
TCCCCATACA 
GGCCACACCC 
GTTGTTCCCC 
TGGAGTTTTT 
AGGGTGGAAC 
GGGTTCTAGC 
GCACCTGCCA 
TTTCACTGCG 
ACTCGCCTTG 
CCTGGCCAGT 
GGGCGAGAGG 
CCACAGTGGG 
TCATTTCCTG 
TCCCATTGGA 
GCTGCAATCC 
ATATCCCATA 
TGACAAGAGC 
AGATGAGCTC 
TTCTCTTGAG 
CACACTGGTC 
CATCCCAGGG 
CGCCACCTTA 
GGCCAAGCCA 
GGCACTGGGC 
ACAGCAGAGG 
GCTAGAGAGT 
CCCAGGCCTG 
AGAGAAAGGT 
ATATTTCTAG 
TGTCCAGCTC 
GGCTCTTCCC 
ACATCCCAAA 
TAAGAGTCCT 
CCTCTGAACC 
CCCTGCTTGT 
GGCTCAGCTC 
TCCCCAGGGG 
CCAGCCAGGT 
GTCAGCATGC 
CACAGGCCTC 
TCTTGTGGGT 
AGAACAGAGC 
ATGCTCCCCT 
GCTTAGACTA 
CCTAAGCAGT 
CACCTGCATC 
TGGGATAAAT 



GGAGCCAGGA GCCTGGGAGG CAAGCCCTGG 

T 

GAGCTAGAGA CCTGGCTTCT CACCTGGCTC 



GCTTCCTGGC 

TGCCGCTCCT 
CAGTTTGGGG 
AATAACCCTA 
GTGTCAGTTC 
GCTGACTCGT 
CAGGCTTAGA 
ATGGAAGCTG 
.GTTGTTGTTT 
ACAGTGGCGT 
GATTCTCCTG 
CCGTGCCCAG 
TTGGCCAGGC 
GCCTCCCAAA 
GGAGTTCCTT 
ATTTACAGGA 
TGGCTGAGTC 
CCACCCATGT 
GGGACAATCT 
AATGCCTGCT 
ACCTCTTTGT 
ACCCGCCCAC 
CCCTGTTTGG 
TGCCCCAGGC 
TGTCATTTGT 
GCCCAGGCTG 
AGCTCCCATG 
GGGGTGTCGA 
CAGGTATGGG 
CTCAGAAAGC 
GGGGCAAGTT 
TGCAAGACCT 
ACCACCTTCA 
ATCTGCCTTC 
TGCGCCCTGC 
TCATGTAGTG 
GCCACAGTGG 
TVAGGCTGCTG 
AATCACTGGC 
CCAGGGAGGA 
CATCCAAACC 
AAACTGGGGA 
TTAAGACGTG 
CTGGGCTATG 
CAGATACCAG 
GGAACAGTGG 
TGAACATAAA 
CTGATGAATG 
ATGAGGGTTG 
GGGCACCTAA 
TGCCACAGCA 
TCCAGAAAGA 



CCCCCACCCC TCACATCAGA 
T 

CCTCTGTAAA GGTAGGGCTG 
GACACAAGTG ATCGTAGGTG 
TTATTGAAGA GAATAATACT 
CCTTTTGAGG CCAACGGGTT 
TTACTAATTC CTCTTAATGA 
CTTTCCCAAG AAAACAAGAT 
GTCCTGACAT TCCCTTCACA 
GTTTTGAGAC AGAGTCTCAC 
GATCTTGGCT CACTGCAACC 
CCTCAGCCTC CTGAGCAGCC 
CTAATTTTTG TATATTTAGT 
TGGTCTCAAA CTCCTGACCT 
GTGCTGGGAT TACAGGTGTG 
CTTAAGTACA TGTATTGACA 
AACTATCAGG TCAGTAATGG 
CCCCTATTTT TCTGCTGGTG 
TTCCCCACCC TGAATCCACC 
CTGGACATAT GGGACCTGGG 
GTGCCACTCG CCAGCTGTGT 
GCCTCAGTTT CCTCATCTGT, 
AGGGCTATGA CAGTACAAGG 
CCCACATGTG TCCTAAAAGC 
CACAGAGATC CCCATCTGCC 
TCCTTGAGGT TTGTGAGGGC 
GGCCTGGTTG GCTCTCAGGG 
CTGGTGTCTG TCACTGCTTC 
TTTATATCTC TCAGGTCTGG 
GAAAGAGCAG GAATGGGGCA 
GGGGGGCATG GGGGGT^GGA 
TTGTTTGGTC AATAAATCTC 
ACAGAGAGTC CCAAGGATGG 
GAGTCCAAAG ATATGTTATT 
AGGCATGGCT GGATCCAGCT 
TTTATTCTGT ATTGGCTTCG 
GCAGATATGC CTACTAGTTG 
GAAAAGGGTT TTTTTTCTTG 
CTCAGTGGCC TGGCTTCGAT 
TGTGGGTGGA GAGAGGGTGC 
GTCACATACC TGCCTCTAGG 
AGATGAACTG AAAATAAGGC 
AGAGGAAGCA GGACTGTGCT 
GCACCAGATG CCAGTCATGG 
GCTTAGGGGT ATGTTGGTGC 
GTCTGAGGCA GAAGAATGAA 
CAACTGAGAT ACCCCATCTC 
GAATTTAGTG ATTGGCCAGA 
ATAGGCCAGG TGATGGGATT 
GCCCTGGAGT TGCAGGCAGT 
ACCCGATGGC ATAAAAGCTG 
CTATAGGCAC CAACTGTGGC 
AACATTAGGA ACTTACTATA 

FIGURE ID 



GGCTGGATAG 
A 

TGCCCTAGGC 
A 

GAAGGGGAGT 

TGGGCCACAT. 

GCACATTGAC 

GACTCAAGAG 

AAGGAGGAAG 

AGAGAGCAGA 

CAGTTTGTTG 

GTAGTGTTGG 

TCTGTCACCC 

TCCGCCTCCT 

GGGACTACAG 

AGAGATGGGG 

CAGATGATCC 

AGCCACCGCA 

TCTTTAAAAA 

CAGGGGCCGT 

TGCAGGGAGG 

TTCCTCACAT 

GTCCCACAGG 

GATGTTGGGC 

AACACAGGAG 

TGTGTGATAC 

CATGTGCCCT 

CGCTGTCCCA 

CGGCTCTGTG 

AGCAGGCACC 

CTCTCAATCT 

TTTCCCCTTT 

GTTGGCTCAC 

GTGCACAGAT 

CTTCTCATGC 

GCTGGGGGGA 

TAATATTTTC 

TCTAGGAACC 

TTTTTAGGCA 

CTCCAGGCCT 

ACGGTTCTAA 

GCTGTGCCAG 

TGGTGGAGGG 

GCTGCAGGTG 

AGGAGTGGCT 

GGCTAAAATG 

GATTGGATTG 

TCAGGGATGC 

GTCCAGCTTC 

TCCCTTCCCA 

GCTTGGCCAC 

GGCACAATTG 

GGAGTTCTGT 

GGCGGGTGTC 

TCATACTGAG 

GAATTTTGGG 



3760U 

37700 

37800. 

37900 

38000 

38100 

38200 

38300 

38400 

38500 

38600 

38700 

38800 

38900 

39000 

39100 

39200 

39300 

39400 

39500 

39600 
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40000 

40100 
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GCTAGAGCTA CTCATTCATT CCCCTAGATA ATTTCTAGGC AAGGTTCCAT 40300 
AGTGGAGGGG GAGTTTTGGC TTGGGCATTG AAGGATGCAT AGGAGTTTTC 
TAGATGGGGA AAGAAGGGAA CGGTAGACCA GGCAGAGGGA ACTGCATGAT 40400 
AAAAGGTTTA TGGGTGTGAA AATTCATGGA- ATGTTTGAGG ATTATGGGGT 
TGGGGGATGT GGGAATATGT GTAGCGATAA AGCACCAAAC AAAGCCAAAA 40500 
GTTTAGTTAG AGCCCTGAAT GCCTGCCTCA TAATGGTTTC CATATTTTAT 
ATGCCTACTA TGTGCCAGGC ACATTGCTCA GGGTCACACA GCTGGAAATG . 40600 
GCAGGGCTGA GTTTTTGTTG TTGTTGTTGT TGTTGAGACA. GAGTCTCACT 
CTATCACCCA GGCTGGAATG CAGGGGCGTG ATCATGGCTC ACTGCATCCT 40700 
TGACTTCCTG GGATCAGGTG ATTCTCCCAC CTCTGCCTCC CAGGTAGCTG 
GGACTACAGG CACAGGCCAC CACGCCAGGC TAATTTTTTG TATTTTTAGT 40800 
AGCGACAGGG TCTCGCCATG TTGTCCGGGC TGGTCTGGAT CTCCTGGCTT 
CAAGTGATCC CCCTGGCTCA GCCTCCCAAG GTGCTGGGAT TACAGGCTTG 40900 
AGCCACCGCA TCCAGCCCAG ATCTGAGATT TGCACCCAGT ATTTGAACTC 
CCAAGCCTGT GCTCTTTTTC CTCCCATGGA CATTTCTCTC AGAGATGGTC . 41000 
TCCCAAACAC CTGTCCTTCT TGTTAAAAAA CAGACAAACC GCAAGTAGTT 
CTTTGGAAGC TCAGATTTCT CTTTTGTTTC TTAGTAAAAC ATTTCCCAGT 41100 
TCCCAGCTCC CTTCCAGGGT GTAAGATTTC TTCGGTAACT TACATCTAGC 
TGTTGCTTCT TGTTTGCTCA TGTTTAGAAA GAAAGACAAA AGAGAGTGAG 41200 
AATTTTCTCT CCCTTCCCCA GTCTCCCCAC AACTCACACC CCACCCTCAG 
CTCCCTCTGT AATAGGAAAA TCTCTGAACT CTCTGTAGTT GCTCCAGCAA 41300 
TCTTTTGGAA CTTTGCTTCT TTCTTGTGAA AAAACCTCCC CTTGGCTCAC 
TTTGCACCAG GTTTCCCCAA ATGTGCTTCC AACCACAAGC AGAAATGGAG 41400 
CTGCCAGTAA CCAGGAAGAA AC.TGCCGGGG GCTGAGGAAG AGGAGAGGGA . 
GGTGCATAGC CCTGGATCTC GCAGGGAGAG GGGTGACAGG ATGAGAACTC 41500 
AGGTTGCTCA CTTGCCATCA GGGTCAGTCA TGAATATAGC GTTCATGTAT 
CACTTTTTAA AGCTTTTTTG GAGGGTAAifVA GTAATAGTTA CACAATUkTAA 41600 
AAATACAAAT GGTACAZ^AAG GACTTAGAAT GGAAACATGT TTCTCTCCCG 
ACTCCAGCCT CCTGTTTTTC TTCCCAGAGA CTGACCACTG CTGTCTGTCT . 41700 
CTTGCCAGAA GGGAAAGGGA GGCAAGGTTA GGGCAGGCAG AGGGCATGTG 
CATCCTTTAG AGAGAGCTTA TGTCTATACA AGCAAATGTG TGTGTTCAGT 41800 
CATCGCTGTC TTAGTTTTCT ATTGCTGCAT AATAATGGTA CTACCAGCTT 
CACAGCTTTA AACAACACCG ATTTATTATC TCATAGTTTC TGTGGTTGGG 41900 
AGTCTGGACA TAGCTTAGCC AGGTTCTCTG CTTTAGAGTC TCGTGAGGCT 
ATAATCAAGG TGTGGGATGG GGCTGCAGTT TCATCTGAGG CTCAATTGGG 42000 
GAAGGGTCAC TTCTAAGCTC ATACAATATT GGTGACATTC AGTCCCTGGC 
AGGCTGTTGA ACTGAGAGCC TCAGTTTCGT GCTGGCTGTT GGTTGTAGTT 42100. 
AACCCTGAAT TCCTTCCCAT GTGCCCTTTG CAAAGCCATC AAGGCAGAGA 
GACTTGCCTA GCAAGTAGGA TATTACAGTC TTCTGTAATA TAATCACATC 42200 
CATGAAATCC TCTATATATC CCATCACCTT TACCATATTC TGTGGGTTAG 
AAACT^GTAG CAGGTCCTGC CCACACTCGA GAAGACCAGA TGACACAAAG 42300 
ATGTGATTCA AAGTGGGGAT CATCGGGGCC ATCTTAGGTT TGTCTGCAGT 
GATCACTGTG CCATCTCTCT CTCTCTCTTT TTTTTTTTTT TTTTTCCGAG 42400 
ACGAAGTCGT CACTCTGTCA CCCAGGCTGG AGTGCAGTGG CATGATCTCA 
GCTTACCACA ATCTCTGCCT CCCAGGTTCA AATGATTCTT CTGCCTCAGC 42500 
CTCCTGAGTA GCTGGGATTA CAGGTGCCCG CCACCACACC CAGCTAATTT 
TTGTATTTTT AGTAGAGACA GAGTTTCACC ATGTTGGCCA GGCTGGTCTT 42600 
GAACTCCTCA CCTCAAGTGA TCCACCCACT TCGGCCTCCC AAAGTGCTGG 
GATTACAGGC ATGAGCCACC ATGCCCAGCC CCATCTCTCT TTAAAAAACA - 42700 
AACAAACAAA CAAAAAACAT AAAAAGAAGC AGAGAACACA TACACATCTG 
CATCTTCCCT TGTTTACTTA ACAATAGATC TTGGAAGTCA CTTCTCAGTA 42800 
GAGGCTAGGT TGGGCAGAGC ATTGGATTCT AGGCCAGTGA GTTTGGACTT 
GACCATGGAG ACACTAGGAA GCCCATGAAG GACAGAGAGA GATGCCTCGA . 42900 
CCCTGCCAGT CCTTTAGAAA GATCACCCAG TGCTTTTTGT ATACCAAACC 
CTATTTGAAA TACTTACGTA TATTAACCCA TTTCCTTATC ACCACAACCC 43000 
TGCGGGAAGG GAGATAGGCA CTTTTATTAT CTTCATTTTG CAGATGAGGA 
CATTGAGGTC CAGAGAGGTT ATGTCACTTA CTTAAGGTCA CACAGCCAGG 43100 

FIGURE IE 
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AAGTGGTAGT 
CTCACGAAAA 
AATTAGTCGA 
GAGGTGGC7VA 
GGTGGCXCTT 
CACTGGTGCC 
ACCAGTTCAG 

[exon 
GCAGCCAGCA 
CTGGGCTCAG 
AGTGGCACAA 

CTCCATTCTT 
TTTTATATCA 
GCTGTGGTGT 
C 

CCCACAATTT 
7VACAAGCTAA 
GAAATCTGGG 
GGACAAAGAT 
GTGAAGCCTA 
AGCCTGGGCA 
TGGTGGCATG 
ATCACTTGAG 
TTTACTCCCA 
AATACAATGA 
TATTGAAATG 
TATGTGCTTC 
GCATAAATAC 
TACTGAAATT 
GGGACAGGAT 
TTTTTTAAAA 
GACAGGGTAT 
GTGATCTTCC 
CAGTGTTCCT 
TGAGAGGTTA 
AGACCCCTGA 
AGAATCCCTG 
AGGCGTGATT 
AGTGGGCCGC 
GACTACACAT 
CATCCCAGAA 
TAGAAATATT 
TGGGTTTTGT 
CTTCTGCCGT 
TGCCCTCAGG 
GGAAAGGCTG 
GAAGCAAACA 
CATTGGCCAA 
TTGCAAGGAA 
CGTGATTGGC 
AGGAAAAAGC 
AATGGGAGGA 
AACGTGGTAG 
CAGCCCTCTC 
GTGTCTCCTG 



AGGGACTCTT 
CTCAGAAAGG 
GGAGCCTGAC 
GCAATGGCCT 
AAACATGGTG 
CCTCGCCCCC 
AATCTATAAC 



ACCCTTGTTT 
TTAAACAACT- 
GCATGTTGCT 
GAACAGGACG 
GGGTCAGCTA 
GGCTGGTGCC 
GTGACCTACC 



7: 43406.. 

CCCTGAAGTC TGGGATTTCC 
TGCTATTVACA CCACCTGGAG 
CTGTGAGTAT C/^GAGGCCT 

..43562] 
CCCCTGTGGC CAGACACTTC 
TAGGATGCCT CTAATGGCAA 
ATCTGCCAGG TAGGCAGGCT 



TACAGATGAG 
TGCCT7VAGTA 
CTAGCCTGGT 
AACAACCAAA 
ACGACAGCAA 
CT/^CATCTC 
TAGAACCCTC 



TACAGGGCAC 
TGAGTGGAGC 
AAGCAATGGT 

CCCTGGCTGA 
TCCTGCCATT 
AGGCTGCAGT 



CCATGGCTTA ACACTATAGG AATATATTTC 
CGTGAATGTT GCTGGTTTGT AGGTGGTTTC 
GAGTGAGGTT CTTTCCATCT TGTGGTGCCA 
TCTTACCTAC TTTTGTGTCC TGGTTTCCTT 
TGGACCTCAT TTCAGAATAT TTXTAAATAC 
ATATAGTGAA ACCCCCATCT GTACAAAAAT 
CACCTGTAGT CCCAGGTACT GGGAAGGCTG 
CCCAGGAGTT TGAGGCTGCA GTGAGCCGTG 
CCTGGGTGAC AGAGCAAGAG CCCATCTCTA 
AAT7VAAATAA AATATiATAGA ACTACAGAGG 
CAGTTATAAA ACATTTAAAC ACATTTTTAA 
TTTATTAAGA TCTATAAATA . ATAAGTTCTA 
TGTAATTTCA AAGTAGATAA GCATAAATAA 
GTGATGTGAT ATGAGAATAG CTGTGAGTTT 
CACTGATGCT GTCATTACTG GGGTCTCTTC 
TTGTATTTTA TTTTATTTTT AAAATTTTAA 
CACTATGTTG CCCAGGCTGC TTTTGACCTC 
CATCTTGGCT TCCCAT^GTG CTGGGATTAC 
GGCCCCTTCC TCCATTCTTA ATGGAAGGAG 
GGGAAAGTAA AGATGTZ^TT TCTTTCCCAT 
ATTCTACCTG CAGCCATGTT GGTCCATCAA 
CTCTAGGGCC CCACCATTGT CTGTATCCAG 
ATGGAGATCA CATCTGCTTC TTGAAAGCAG 
ATCACTTCCT CTCAAATTCT ATTGGTGAAA 
AGCCACAAAG GAGGCTGGGA* ACTTTCTCAC 
ACAACTCTTT TCAGTGAGGT ATCCCACAGG 
GATTATCTCA CATAAAATGA AGTCTTACAA 
ACAGCAGCCA AGTGATATCT CTTCCCTTCT 
CCTTCACATG GTGGCATTGT ATCCTTAGAC 
TTGGCCGTTG CACACTGTCT TACATAAAGC 
CTACGAGAGA GTGTACCTTG TGCATCTCTT 
TCTTTCTAGA AGCTTCCCTA GCAAAATTCC 
GACTGTTACA TGTTACATGG TTACTGTTAT 
GACTGGGAAC TCAAATGCCT GGAAAAAGGA 
TCAAGCCTTA GGGTGGGCAT GGCTCCCTGA 
TGTTGAGTGA AGAAGACTGC TTCAGTTTCC 
GTAAGGGCTG TCGTGAAAAC TCAATGAAAG 
GTGCAGTGGC AGCTGGCAGT ACCCTGACCC 
AGCATTGCTC ATCCTGCACT GTGGATATCA 
CCCTGGGCTG TGAGCTCCAT AGGCAGGGTC 
nOURE IF 



ATTGAATTAT 
ACATACAGCT 
CACAGTTACA 
TACCCAGGCT 
CCAGGGTGGG 
CCTTTTCTCT 
CCTCCGCATC 
A 

GGGTGAGGGC 
CCCAGCACCA 
AATCTCCACT 

GTCTCTGGGC 
AGATACACCT 
AACACACAAG 

TTGCTCATGT . 

CCTCCCTGTA 

TCATTCTCCA 

TGGCAGGCTG 

ATAAAATCCC 

TAGCCAGGCA 

AGGTGGGAGG 

ATCGTACCAC 

AAAATAAATA 

AAACTAATTG 

TCTAGAGATA 

GGGGTAGCTC 

TACTTTATGA 

TGTTTTGCTG 

CCTCCATTCT 

AATAAATAGA 

CTGGGCTCCA 

AAGTGGGAGC . 

ATGCTAGGTG 

CCAAGTTCTC 

CCCCAAGTGA 

CCAGCAGAAG 

ACAGCCCGGA 

ATGGTCACAT 

TTGGAACCTA 

TCTTTCGCAG 

ATGGACCTAC 

GCTGTCTTCC 

TTGCCACCCA 

AGGAAGGAAA 

TTTTAATCAG 

CCTTACATCT 

TACTTGCTCA 

ACAGGATAAT 

CAAGGGAGAG 

CCATCTGTAT 

AAGATTCTTC 

TGCCACCGCA 

GTTGAGCCAC 

TCCATGGCTG 



43200 
43300 
43400 
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43600 

43700 
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44000 
44100 
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TATCTCCACJA ACCCAGCACA GAACCAGGTG CTTGGGAAAG TTTTGAA'i'TG ^bbOO 
ATTCTCATCT GCCATTGGCA TGGGGAAGGG T^CTAGCTTG TATGAAACAG 
ATAACAATGT ATGGGACCCT CATTCATTAT TTCAGCAAAT ATTTGCTGAG 45900 
TTCCTCCTAC ATGGCTAGCC CTG.TGCTAGA CACTGGGGAA TCGGCGATGA 
ACAAAGCAGA TAGAAATCCC CACTCTTGTG GAGCTGACAT TCTGGAGGGA 46000 
GAGACAAAAA GCAAACATAT AAAGAAAGAA AGAAATCACA TGGATCTGGA 
TGACAGTGAG TGCTGGGAAG AAAATAAAAG CAGAGGAAGG GGATGGAGCG 46100 
ATGGGCAGGG GGCAACGGTA GGGAGGGTGT CGGGGAAAAC TTTTTGGAGA 
ATGTGACGAT GAAAGTGAAC AAGGAGAAGT CAACCGTGTT GAGATGATGG 46200 
CAGCTAATGA TGTGGACAGG CCACTCTGTT CTGAGTGCAT TATCTATTGA 
TTCATCATGT CATCCTCGCA ACAGCCCTGC ACGATCAATT CTGTCATTAA ' 46300 
CCCCATAGTA CAGATGAGGA TGCGGAGGCA CAGAGAAGAT AAGGGACTTG 
TCCTGTGTCA CACAGCAAGG AGCCATCCGG CTCCTAAGTT GGTGCATTTG 46400 
ACTTCTGTGC TTCCGGT^G AAAGAGCAGC AAGTTTAAGA TCTGGAGGTG 
GCACTGAGCT TTGGAGGAGC AGGGGGCAAT GAGGTGGCCG GTGTGACGAG 46500 
GACTCAATGT GCAAGAGGGA GAGTGGTGGG GAGATGAGGT GGAGGGGTGG 
TCGGCGGTCA GATCGTGGAG GGTCTCGGAC GAGGGTCCTG ACCCTGGGTC 46600 
TCCAGTCCTG GGAAGTGGAG CCCAGGCTGT ACCATGGCTG ACCTCAGCTC 
ATGGCTTCCC CTCCCACTTC CAGCCTACAG GGAGCCCTTC GAGCAGCACC 46700 

[exon 8 : 46674.. . 
TCCTGCTGGG CGTCAGCGTT TCCTGCATTG TCATCCTGGC CGTCTGCCTG 
TTGTGCTATG TCAGCATCAC CAAGTGAGTC CTGGGCCCAG TGCTGCCGAG 46800. 
..46773} 

CAGTCCCTCT GGAGTGCAGG GTGGCAGGGA CTTGCCCCTC TAGTCTGCCC 
CTTTGCAGTC CTCTCAGTCA ATAATACGTA TTTACTGAGC AGCTACTACA 46900 
CACCTTGAGA GTAGAGCTGA GAACATATCG ACAAGGACCC CACTTTTTTC 
TTTTTTTCTT TTTTTTTTTT TTTTGAGACG GAGTCTCACT CTGTCACCCA 47000 
GGCTGGAGTA TAGTGGCACA ATCTTGCCTA ACAGTAACCT CCGCCTCCCG 
GGTTCi^AGCA ATTCTTCTGC CTCAGCCTCC AGAGTAGCTG GGATTACAGG 47100 * 

CGCATGCCAC TATGCCCGGC TAATTTTTTG TATTTTTGGT AGAGATGGGG 
TTTCACCATG TTGGTCAGGC TGGTCTCGAA CTCCTGACCT CATGATCTGC 47200 
CTGCCTCAGC CTCCCAAAGT GCTGGGATTA CAGGTGtGAG CCACTGCACC 
CAACCAGGAC TCCACATTTC TAAAACCGGC ATCCTACTGG GGAGACTGAA 47300 
AATACATATC AATCACAAAC AGGTGGXTTT CCATAGTGAC CCACTCTCTG 
AATGCACTAG ACCAGGGTGG AGGCCAGAGA TCTTCTGGGG TGCTTTTTGC 47400 
AAGGGGGACC AGGATAAGGC TCTCCAAGGA GGGAAAATTT GAGGGGGGCC 
CTGACTGGGG AGAATGAGCT GGCCAGGGAT AAGCAAGATG GAGTCATCCC 47500 
ACATCCCCTT ACAACACTGG GTGCCTGGGC . AACTGGGGGC ATTTGGGGGC 
ATGTGGTAGG AGCCAGAGGA ATTTGCGACG ATTQCCCTGA TGGAGTCAGG 47600 
AGACCTGGGT TTGAATCCTG GCCTTGGAGC TTGGTAGCTG GCGGCCGACA 
AGTTGCTGAA ACCCCTGAGC CTGGGGTTCC TGCTTTGCAG AGTGACAGTG 47700 
ATGGTGAGAA CATATTTCAT CAGCCAGAAG AGGCCAAATC ACAGTAAAGG 
CTGAGGGAGG AGATGAGTGG CGAGTGGCTG GGAGGTGGTG GAAGGAGCCT 47800 
CGTTTCCAGA GAGCTCTTGC CAGCCCTTGG AATCATGGTG TCTCAGAGGC 
TCAGTCCTCC CATCTCTGl^V ATGGGACTAG CAAGCTCAAC CTCACT2VAGT 47900 
CAGGATTAGA GGTGGCTAAG GATTATTAAC ATGATTGATG AAAGTGCCCA 
CXCTTGGCCC AGCACACACT AGGTAGGCAG GGAATGCAAA TTCCCCTCCA 48000 
TATCTTGTCA CTGATGCCTC CGAGCAACCT TGGACTGATC GCCTTGCTCT 
GAGCCTCAGT TTCCCCATCA CCTGTACCTC TTCCCACTCC CCATCACTAT 48100 
ATCCCAGCAT GCCAGCCTCT TTGCTGTTCT TTGTCTTTGG TTTCTTGTTT 
TGTTCTGTTT TTTAGACAGG GTCTCACTCT GTTAGCCAGG CTGAAGTGCA 48200 
GTGGCGCGGT TACGGCTCAC TGCAGCCTCC AATTCCTGGG CTAAAGAGAT 
CCTCCCATTT CAACTTCCAG AGCAGCTGGG ACAACAGGCG CTTGCCACCA 48300 
CACCTGGCTA ATTTTCTTAT TTTAATTTAA TTTTATTTTA TTTTTTGGGA 
CAGAGTGGAG TCTCAAAAAC CAAGCTAGAG TGCAGTGGTG CGATCTCGAC 48400 
TCACTGCAAT CTCTGCCTCC CGGGTTCAAG CGATTCTCCT GCCTTAGCCT 
CCCGACTAGC TGGGATTACA GGCGTGTGCC ACGACACCCA GCTAATTTTT 48500 
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GTATTTTTAG TAGAGATGGG GTTTCACCAT GTTGGCCAGG ATGGTCTTGA 
ACTCCTGACC TCAAGTGATC CACCCACCTC GTTCTCCCAA GGTGCTGGGT 48600 
ACAGGCATGA GCCACTGTGC CTGGCCAATT TTCTTACATT TTGTAGAGAC 
TGGCTGTCAC TTATGTAGCC CAGGCTGATC TTGAACTTCT ACCCCTTTAT 48700 
CTTTATTCAT GGCACTTATT ACCATGAATG AATGACCTCA TATAAGCATT 
TCTTTCGTTT tTTTTTTTTT TTCTTTGAGA TGGAGTCTCA TGTTGTCCCC • 48800 
CAGGCTGGAG TGCAGTGGCG CGATCTCAGC TCACTGCAAC CTCCGCCTTC 
CGGGTTCAAG CGATTCTCCT GCCTCAGCCT CCTGAGTAGC TGGGATTGCA 48900 
GGCGCCTGCC ACCATGCCTG GCTAAGTTTT GCATTTTTAG TAGAGACGGT 
GTTTCACCAT ATTGGCCAGG CTGGTCTCGA ACTTCTGACC TCAGGTGATA 49000 
CACCTGCCTT GGCCTCCCAA AGTGCTGGGA TTACAGGCGT GAGCCGCCAT 
GCCTGGCCTC ATATAAGCAT TTCTGTCTCC ATTTATCATC CATCTTTCCC 49100 
TCTTGAAGGT CAGTTTCACC AAGGCAGGCA TCTTTGTCTC GTTCACTGTT . 
GTAGCCTCAG GGCCAGGCAC AGTGAGTCAA ACATAGAAGG TGCTCAATAA 49200 
ATATGTGTTT ATTTATTGAA ACCATGGGCA GAGGCTAATT CAGAAGCGGT 
CTGAGGACCT TACCTCCCAG TGATGATGCA CCATGGCCCC AGGCAGGCCA 49300 
GGAAGAGAGA AGGGTTGTGT TTCTCCGTAG GTCGCCCAGC TTCCCAGGCC 
ATCCCAGGCC ATTCCCTGGT CATTTGCCCT CAGCTGCTCT GAAAAAGGGA 49400 
TTGTTGAGGG GT^CCTAGAA TCCTCTCTCT GCAGTTTGAG TCTTTCCTAA 
TCCCCTGGGG TCTCATTCCC ACTGAGGACA TAGGTGGCCT CCTCAGGAAC 49500 
TCTGTGCTGG GTAACAGAAT GCGGGAGTGT GAACCTGGCT CTGCCACCTA 
CCAGCTGTCA CTCCACCTCC TTGGGCCTCA CTCTCCTCAT CTGTAGAATA 49600 
GGGTTAGCAA TAGAATCCAT GTCACCAGGT TAGAATGATG AGTCAGTGGT 
TTGACCTCCA GAAACTAATC AGCCTGATCT CTGATGCCAA ATAAGTATTG 49700 
GTGATAACGA CCACTTTTAT GGGAGGAGCG TTCACCTGTC AATAATTCAG 
AGATCAACAC CTTTTCCTTT TGTTTTTCAG GATTAAGAAA GAATGGTGGG 49800 

[exon 9: 49781. . 
ATCAGATTCC CAACCCAGCC CGCAGCCGCC TCGTGGCTAT AATAATCCAG 
GATGCTCAGG TAGGAGTAGG CGTGGATGAG GACATGTGGG ACTGTGTACA 49900 
.•49859] 

TGAAGJJVGTG TGGTTCAGAA CACCTGGGCT GTTAAGGACC TTCACTGGCT 
TCTGGAATGG CAAATAGACA GTCAGGAGGG TTGCAGGGGA GACAGAGGCA 50000 
GAAGCCGAAT GAGGTCATTA GCAGACCAGA GGCTTTCCCG CCCTTCCCCT 
TGGCAATCCC AGCCTGGGGT GGGCTTCTCT GGGGTTGGTT TCCTGTTTTT 50100 
TTCCCTCCCC TTGGGAGAAT GACCCTTGGG TCATCATCAC TGTGTCATTC 
CCTGGGGAGG TGCCAGTACC AGGGCTAGAG GCCAGAAGGA GTGGAGGAAG 50200 
GAGAGGGTGA CAGGCTTTCT GTGTCTTCTT CTTAAGCATA GGAAACTGCC 
CCCGAAGCAC TAGCAAATCC CTTCCGGGTT CTCATTGGCC TG7VAATGTAT * 50300 
CCCACCCCTA AGCCAGGGGT GGAGTCAGCT TCCCCAAGGC GATGGTCCTG 
TGGGTGAGTG GGTGGGGTTT GCCTGAGCAA GATGAGAGTT CTCTAGGTAG 50400 
GAGAAAGGGG GATTATAGGT CCTGTCTAGA AGAGAAGGTC TGAGGGTCCT 
TGCTTTTCCA GGGACTCTGG AATCTAGTGT TGGCTTTGAA TCCTGACTCT 50500 
GCCACTCACT GGCAGTGTGG ACTTGAGCAA GTTGCTTAAT TCTCTGAGCC 
TCAGTTTCCT CTTGTGGGTT ATAACAGTGT TTACCTGGTA GGACAGATAT 50600 
TGGAATTTAT TGAGACAATA CATATAAAGT GCATATTCCA GCCTCTTGCA 
AATACCAAGT GCCATTTATG TATCAGTTAG TGTTTGCTGT GTAACAAATG 50700 
ACCCCGAAAT GTAGAGGGTT ACAACAACTT TATTTAGCTT ATGCTTCTGC 
AGGCTGGCAT TTGGGGCTGG GCTCAGCAGT GAGGGTGGCG GGGGAGGCTG 50800 
GGCTGGGCTG GGCTGGGCAG ATCTGAATTG AGCTGACCCG TCCCCGTAGC 
CTCCCTCCGT GTCTGACAGT TGGCTTTTTT TTTTTTTTTC TTTTTCTGAG 50900 
ACGGAGTTTT GCTCTTATTG CCCAGGAGTG CAATGGCATG ATCTTGGCTC 
ACTGCAACCT CTGCCTCCTG GGTTCAAGCA ATTTTCTTGC CTCAGCCTCC 51000 
CAAGTAGCTG GGATTACAGG CATGTGCCAC CACGCCAGGC TAATTTTGTA 
TTTTTAATAG AGATGGGGTT TCTTCATGTT GGTCAGGCTG GTCTGGAACT 51100 
CCTAATATCA GATGATCCAC CCACCTCAGC CTCCCAAAGT GCTGGGATTA 
CAGGCGTGAG CCACTGCACC CAGCCTAGTT GGCTGACTTT TACCTGGGAC 51200 
AGTGCAGGTG CCTGAGCCAT GTGCCTCTCA CTCTCCAGCA GGCCGGCCCA 
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GGCTTGTTTA CAGAGTGGCT CAGTTTTCAA GGGTGGGAAG TCCCAAGGCT ■ 51300 
TCTTGAGGCC TAGGCGCAGC ACTGGCATGA TATCACTTCC ATCACATTCT 
ATGGGCCCAA GCAAGTCCCA GGGCCAGTGT AGATTCAAGG GATGGGAGGA 514 00 

GATTCAGAGC ACTCCTCTGT GGCCACTTTT GCCATCGACC ACAGTCCCTG 
TAAATATTAG GACAATGTAA TTAATTCCCA GGAATCTGAG GCTCAGTU^AG 51500 
CGTAAQTGAC CTGTTGGACT TCTGATCTGT GTGATGTCGA GGCTTGTACC 
CCTTCCTGAG CATTGCCGTA CTCCAGGCCG GGCTGCAAGG CCACTCTGCT 51600 
CTTTCATTGG CTGTCTCTGT ATTTTAGGGG TCACAGTGGG AGAAGCGGTC 

[exon 10: 51628. . 
CCGAGGCCAG GAACCAGCCA AGTGCCCGTA TGTATCTGAA CTTAGGTCAC 51700 
. .51677] 

AGCCTGCATG CATTGGGAAG GTGATAGAAT TGGAGAGGCA AGCCCCTAGC 
TCCATGTCTG CCTTCTCTTC CCTGCATTCG GTAATTGCCC TGTGACATTA 51800 
GCCTTCAAGG GACGGCAGGA GGAGGGGTGT TCTGGAAACG TGGACTGCTG 
GCCAAGCCCC CTGAGTTTCA CTGGTGTGTC AGGTACATGG TGATACCCCT 51900 
TGGGAGTGCT GXTATAGXTA ACAACCAGAG CAGCCGTGCC TGTTGTTAAA 
ATCTTGACCT AATTGTATAC TTGTCGGCAA ATAGCCACTA TCCTGAACAC 52000 
TCCCCTCCTT TTTTTT/^^TA TACAGGATCT CACTCTGTGG CCCAGGCTGG - 
TGTGCAGTGG TGCGATCATA GCTCACTGCA CCTTCAAACT CCTGAGCTCA - 52100 

AGTGATCCTC CCATCTTAGC CTCCCGAGTA GCTGATACTA CAGATGTGCA 
TTACCACGCC TGGCTATTTT AAAAGGTTTT TGCCTGTAAT TCCAGCTACT 52200 
CAGGAGGCTG AGGCATGAGA ATCACTTGAA CCCGGGAGGC AGAGGTTGCA 
GTGAGCGCAG ATTGTGCCAC TGCACTCCAG CCTGGGCGAC AGAGTGAGAC 52300 
TCTTGTCTCA AAAAAAATAA TACCAAAAAA AGTTTTTGTA AAGACAAGCT 
CTCGCTGTGT TGCCCCGCCA CTGTGGCCTC CTTAGCTTCT TCCCTGGGGC 524 00 

CTGCTGGACC TTTCCATACT CCAGAAACTA AAGGGGGTCC AGGACCCTGC 
TTCAACCCTA GGATCCCGCA TCTTTTTTTT TTTTTTTTTT TTTTGGACGC 52500 
AGGGTCTTGC TGTGTCCCTC AGGCTGGAGT GCAGTGATTC ACTGCAGCCT 
CAAACTCGTG GGCTCAAGTG ATTCTCTAGC CTCAGCCTTC TAAGTAGCTG 52600 
GGACTACAGT CATACACCAA CATGCCCAGC TAATTTTCCT TTTTTTTAAT 
TCTTGTAGAG ATGTTTGAGA CGGCTTGGGC TCTGTTGCCC AGGCTGTTCT 52700 
CAAACTCGTG AGCTCAAGCG ATCCTCCCTC CTCAGCCTCC TAAAGTGCTG 
GGATTACAGG CGTGAGCCAC CGCACCCGGC TTCCATATCC TTTCTAATTG 52800 
GTCATGGCTT GGGATAATGG TGTTGCTTTT AATTATCATC ATCCATAAAG 
ACTTTTTCTT ACTCAACAGA TCTGAGCTTG TATTTGGTGC CCAGGACATG 52900 
TGCTGGGTTC CCGAAATCCC AAAGACACAG ACCCTACCCT CAGGGATTTC 
TCATTCTAGC AACATAGACT GATCAATTAC TGATTATAAC GTTAGAAGGC 53000 
ATGTCTGAAG TAGACAGCCA TCAGGACATCS GTGATTTCAG GCTGGGCTTT 
C 

GAAGAATGAA TAGGAGTTTT TCAAGTGTCG AAACTGAACC CTGACCAACC 53100 

T 

TTTGCTTTTG CAGACACTGG AAGAATTGTC TTACCAAGCT CTTGCCCTGT 
[exon 11: 53114. . 

TTTCTGGAGC ACAACATGAA AAGGGATGAA GATCCTCACA AGGCTGCCAA 53200 

c 

AGAGATGCCT TTCCAGGGCT CTGGAAAATC AGCATGGTGC CCAGTGGAGA 
TCAGCAAGAC AGTCCTCTGG CCAGAGAGCA TCAGCGTGGT GCGATGTGTG 53300 
GAGTTGTTTG AGGCCCCGGT GGAGTGTGAG GAGGAGGAGG AGGTAGAGGA 
AGAAAAAGGG AGCTTCTGTG CATCGCCTGA GAGCAGCAGG GATGACTTCC 53400 
AGGAGGGAAG GGAGGGCATT GTGGCCCGGC TAACAGAGAG CCTGTTCCTG 
C 

GACCTGCTCG GAGAGGAGAA TGGGGGCTTT TGCCAGCAGG ACATGGGGGA 53500 
T 

GTCATGCCTT CTTCCACCTT CGGGAAGTAC GAGTGCTCAC ATGCCCTGGG 
C T C 

ATGAGTTCCC AAGTGCAGGG CCCAAGGAGG CACCTCCCTG GGGCAAGGAG 53600 
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CAGCCTCTCC ACCTGGAGCC 
AGACAACCTG ACTTGCACAG 
CTTACCGCAG CTTCAGCAAC 



CTGGGTCCAG 
GATGCCCTGT 
AGCCAGAAAC 
GCAGCTGCAG 

ACATGCGGTG 
GTCCCCCAGG 
AGTGCTGTGT 
GGGGTATAAG 
CCCCAGTCCC 
CGCAGTCCGC 

TCTGGAGCCG 
AGGAGCAGGC 
TACTCAGCCC 
CCAGGAGGAT 
GCTGCTGTGG 

GACCCCTCTC 
CCTGGCACCC 
ATCCTGCCCC 

AACTTTGTCT 



ACCCACTGCT 
GTCCCCCAGG 
CTGGGAGCAG 
CCCCCGTCTC 
T 

GAGCAGGGTG 
AGAGGCTGGT 
CCCCAGAGAA 
CCTTTCCAAG 
TGTCCCCTTG 
AGAGCTCACA 

GGGGAAAAGG 
CACAGACCCC 
TTACCTGCCA 
GGTGGCCAGA 
AGACAGiSTCC 
G 

CAGGTGGGGT 
TCGGGCATCT 
TGGCAATGCT 
C 

CCGTGGGACC 



AAGTCCTCCT 
AGACGCCCCT 
TCCCTGAGCC 
C 

GGCCAGACAC 
TCTCTGAGCC 
ATCCTCCGCC 
GGCCCCCACC 

GCACCCAGGC 
TACAAGGCCT 
ATGTGGGTTT 
ACCTCATTCC 
TTCACCTTTG 
TCTCCCAAGC 

TAGAGGACAT 
CTTGTGGACA 
CCTGTGCGGC 
CCCCTGTCAT 
TCGCCCCCTA 



GCCAGCCCGA 
CGTCATCGCA 
AGTCACCGTG 

CTGGAGGAAG 
AACCACTGTG 
GAAATGTCCT 
AGTGGCTATC 

CAGTGCGGTG 
TCTCAAGCCT 
GGGGCTAGCA 
TGGCTGCCCT 
GACTGGACAG 
AGCTCCCCAG 
T 

GCCAAAGCCC 
GCCTGGGCAG 
CACCTGAAAC 
GGCCAGTCCT 
CAACCCCCCT 



TCCACTGGAG GCCAGTCTGT 
CAGAGAAGAG TAAATCCTCA 
CAGAGCTCAA GCCAGACCCC 

CACATACATG AGGGTCTCTT 



. -54692 

CCTCTTGTTG CTGAGTCTGC AGATGAGGAC TAGGGCTTAT 



GAAATGCCAC CTCCTGGAAG 



TGAAGAACCA 
CTGGGCTGCA 
TCCCATGAGA 
GCAGGAAAAC 
CTGCTCCCTC 
AGGCATGTTT 



TGGTATGAAG 
GAGACTGGAC 
GTAGAGGGCA 
TGAGGCCCTT 
CACAGCTTCT 
TGCCCACCAG 



GCAGCCAGGC 
G 

GTGATTGGCC 
CCCGCCCAGC 
CTGGGTCGCC 
GGGCACCTCG 
GCAGCAGACT 
ATCATGGCCC 



CTGTCTCACT GAACTAGAAG CCGAGCCTAG 



TGGCAGATTT 

CCACTGACGT 
ATTGGGCTGG 
GTGCCCCACG 
ACTTGTGAAC 
GTCCCTGTTG 
ACATGGAGGC 
G 

AAACTAACAC 



GGAATGACTT GGGCGGCCTT GGGT^TCGA TGAGAAATTG 
AGGGTGGTCA TTGCCTAGAG GTGCTCATTC ATTTAACAGA 
GTTGATGCTG GAGGCAGAAT CCCGGCTGTC AAGGGGTGTT 
GAGCAACAGA GGACATGAAA AATTGCTGTG ACTAAAGCAG 

A . 

CTGCCAAACA CCCATGCCCA GCTGTATGGC TGGGGGCTCC 
GGAACCCCCA GAATAAATAT GCTCAGCCAC CCTGTGGGCC 

T 

GACAGCAGGC ATAAGGCACC AGTTACCCTG CATGTTGGCC 
GTGCTAGGGA AGGCGGGAAC CTTGGGTTGA GTAATGCTCG 

T 

TTAGTTTCAT CACCTGTTAT CTGTGTTTGC TGAGGAGAGT 
GGGTGGAGTT TTGTATAAAT AAAGTTXCTT TGTCTCTTTA 
ATTAACCAAA CATACCTCCA GACACTGCTG TGAGTGCTGT 
ACTCCTGGAA TTCACCCATC CAGAGGAACC AGGATGC7UVG 

FIGURE IJ 



CCCAGAGTCC 
GGCAACCCTG 
TCCCAGAGAG 

TAGAACCCGA 

CCCCAACCTG 

CCAGCATGGG 

AGGAGTTTGT 

G A 

GTGGGCTTGG 

GCTTGCCAGC 

GTGGGGAAGA 

GGGGACCCTG 

GGAGCCACCT 

AGCACCTGGG 

CCACTTCCCC 
TGGCATTGTC 
AGTGTCATGG 
TGCTGTGGCT 
GAGGGCCCCA 

GTCCGGCCTC 
TCATCCTTCC 
CAAAATCGTG 

AGGTGCATGT 
C 0 • 

CCATGCCTGG 
T 

ccaaaagAct 

tggcctaaca 
gctcgccaca 
gcaggcccct 
gagttgttgg 
taactgccca 
ccacctgcct 

agccatcaag 

A 

AACTTCAGGG 
GCTTCCTTAG 
CAGTTAAGGG 
GGACAATTTG 

TCGTATGCAT 
GGGCAATCCA 

CAGACCTCAG 
TCTGTGTGTT 

GGAACAGAAG 
TTTTTTATGT 
GTCTCTGTTA 
AGGTTAAGAA 



53700 

53800 
53900 

54000 
54100 
54200 

54300 
54400 
54500 

54600 

54700 

54800 

54900 
55000 
55100 

55200 
55300 

55400 

55500 

55600 
55700 
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ACTTGCCGTC TGGGTTTGGG TTCCCCATAC 
A 

TAGGAAGTAA TCCCGGGAAA CCCTGCTAAG 
GG7VAGGACAC AAACCAAGAA AGTGTTACCT 
CCCC7\AAAGA GGGTTCTTGA ATCTCATGCA 
CCATAGAGTC AGTGAAAGCA AGTTAATGAG 
ATGGCTACTC CGTAGACAGA GCAGCCCTGA 
TTATGGTTAT TGATTAATTA TATTCCAAAC 
TCCCTTTTAG ACCATATAGG GTAACTTCCT 
AAACTGTCAT GGCGCTGTTG GGAGTGTAGC 
ACTCTTGTTG CCATCTTGGT TTTGGTGGGT 
CAACCTGTTT TATCAGCAAG GTCTTTATGA 
CTGTCTCATT CTATGACTAA GAATGCCCTA 
AGTAAGTCTC AGCCTCATTT TACCCAGCCC 
AATAAACCTC TGACAAAAGG GTGAGTTATT 
TAACTGATGC TTACCTGCCG GGGATCTCTG 
GCCCAGTTAT GCCTGCAAAG GAGAGGGAGC 
TCCCATCTGT CATTGGCTGA GAGCTGCTTC 
CACTTCCAGC TACTCCAGGA AAAAAAAAAT 
GTGTTGAGAG ACTCTGGCAC ACCAAGAAGA 
TGGCTGATGA TACACTGCCA AGGTCACACA 
TAGTGGAATC CAGACAGTGT CTCCATCACC 
GCGCTTCACA TCCGAGGCAG GCAGAGGGAT 
AGGCTGGGAA CCTGAAGCTC CTATGTCTGT 
TAGGTGCCCT GATTTCACAC TTGAGGGGCT 
CTGCTCTAGG AGCCTACATA CTACACTGGA 
CTCACATGCA GCCTGATGTT TGTTAGAAAC 
ATGGCTCATG GCTGTAATCC CAGCAATTTG 
CACTTGAGGT GAGGAGTTCA AGACCAGCCT 
ATCTCTACCA AAAAATAAAA AATTAGCCGG 
AATCTCAGCT ACTTGGGAGG CTGAGTTGGT 
CGCGGAGGTT GCAGTGAGCT GAGATTGTGC 
GACAGAGTGA GACCCTGTCT CAGGAAAAAA 
TTGTTCTAAG CCAAAATCAA TCCCTTTAGC 
CAGATGGAGA AACAGTTTTA GAGAGGAAAA 
CAGAGAATGG CAGAGCCTGA ACTAGCCTTC 
GCTCTTTATA ATAAAATATA ATTTTAAATA 
GCCAAGCAAT ATGCTAAGTG CCGTCCAGCC 
AAACAGCTCT AGTTGGGAGG CTC7VATGATT 
GAAACAGGTC CAGAGAGGTT GAGGATTAGC 
AATCCTGGAG CCAGGATTTG AACCCGGGTC 
GCCGTGATAT ATCAGCTTAT GTCATCCTGA 
CGAATCCACT TTGCCTGAGC ATTGTCTCAG 
AGGCAGCAAA TAG7\AAATAT ATTTGACTGC 
GAGCCCAACA AAGGATCTTA GGCAAAAGAA 
CAACTATTAG TGAACTGGCT GGGCTTTGGG 
GACCTGGTGG ATCAGGTGGG CCCAACTCAG 
AAGTAGGAAA AGTCCAGGGT CATAGGCCCA 
GTTTCAGCCT CCGGGGCTGG ACCAGAGGGC 
AGCAGCGCCA GAGTGGGCTG AGTGGCCTGG 
AGAGATGTTG ATTTGGGGGT ACTCCCTCAG 
TGTGGGGGAT GAGGGGAGGG GGAAAGGGGG 
CTGAATTTTT TTTTTTTTTT TACAAAAAGT 
TTACTCTATC AGCACGTGCA GACCTTTTCC 
ATATAAAGAG GAAAGTGAAG AAAAACCACC 
AGCATCTGGC ACTGTGTGGG CGATCACGAA 
GCGTAGTATC TCCGTGAACA TCCGGTTGAA 



PCTAJSOO/19094 
AAGGATTCAA ATAGTTGATT 55800 

GTAGTGGGGA ACTGAGGCAG 
GAAAGGGGTC CAGATGGAGA 55900 
AGAAAGAATT CAGAGCGAGT 
GAAAGTAAAG GAATAAAAGA 56000 
GGGTTGCTGG CTGCCTATTT 
AAGGGGTG6A TTATTATGGC 56100 
GATGTTGCCA TGGCATTTGT 
AGTGAGGACA ACCAGAGGTC 56200 
TAGAGCCATC TTCTTTACTG 
CTTGTATCGG. TGACGACCTC 56300 
ACCTCCCAGG AATGCAGCCC 
CTCTTCAAAG CTCCAGTTTA 56400 
CAACAGATTA CCAGCATGAG 
GAAGACCATG CATGGCACAT 56500 
TGGGGTATTT GTCCACCAGC 
CAGGAGCATT AATTCTCCAG 56600 
TCTTCAACTG AGAGTTGGAG 
CAGGT^CAGG ACACCAACAG 56700 
GCTAGTTAGC AACAGATCTA 
CAGGCTCTCT GTAGTGATCT 56800 
GGTGTGGGCC TTAGATGGGA 
ATCACTTTTG CTTCTCTGAG 56900 
TGGCCATTTT AGATTCCTTC 
AATGATGGGG AGCTCTCTAC 57000 
ACCTCCTTGC GCCAGGCATG 
GGAGGCTGAG GCGGGTGTAT 57100 
GGCCAATATG GTGAAACCCT 
GTGTGGTGGT GGGTGCCTGT 57200 
AGAATTGCTT CAACCTGGGA 
CATTGCACTC CAGCTTGGAT 57300 
AAAAAAAAAC AAAAAAAACC 
TGCCCAAATC ACACAGTTTA 57400 
GGGACTTGCC CAAAGTCACC 
TGGACTTCTT GCCTCCAAAA 57500 
AZ\AATAGTTA TCTGTTTAGG 
ACTGTGTCAT TTACGTCTCC 57600 
ATCCCAATTT TACAGATAAG 
CTAGAACCAC ACAGCTAGGA 57700 
TGACCTAAGA GCTCCCAGCC 
CACCTACGCA GATGTCGGCT 57800 
AGAAATCTAA TTTAAAAATT 
TAGAGATGCA ATGGGACTGG 57900 
ATCCAAGTTG TTGGCCTCAG 
AAGCTACAGA GGGATGAGAA 58000 
GCTGGCCCCC ACCCTGCAGG 
GTGAGATGCC GGCTGCGGGA 58100 
AGGAGGGGAC GCCCCTGGGT 
GCCCCTGCGG GGGAGCTTTC 58200 
CCCTGCCTTT ACACAGAATT 
GAGGAAGGCA GTGAGTGCAT 58300 
"GGCTTATTGC ATTTTTCTGA 
TATTCAGAGA AAGCCTGAAG 58400 
GGAAATCCCA TCCCCGCCCC 
ATGAGCGCTT GTTTTTGAAG 58500 
CAACCTTTCT GACTTTATTT 
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TTCCCACGAA AGTTATTAAT TAAAAAACAA AAAGCAAAAC ACCGAAAAAA 58 600 

CAAAiiAACCC AGCAAGTGTT TGAGCTCCCA CCACGAGGGA GGCCTGACGT 
CACTGGATCC TCCCGGCAGC CGATGAGGCT GCATGGGACT 58690 
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POtmORPHISMS ZM THE CODING SEQUENCE OF Ii:i4Ra 



ATGGGGTGGC TTTGCTCTGG GCTCCTGTTC CCTGTGAGCT GCCTGGTCCT 
GCTGCAGGTG GCAAGCTCTG GGAACATGAA GGTCTTGCAG GAGCCCACCT 100 
GCGTCTCCGA CTACATGAGC ATCTCTACTT GCGAGTGGAA GATGAATGGT 
CCCACCAATT GCAGCACCGA GCTCCGCCTG TTGTACCAGC TGGTTTTTCT 200 
GCTCTCCGAA GCCCACACGT GTATCCCTGA GAACAACGGA GGCGCGGGGT 

G T A 

GCGTGTGCCA CCTGCTCATG GATGACGTGG TCAGTGCGGA TAACTATACA 300 

. , C 

CTGGACCTGT GGGCTGGGCA GCAGCTGCTG TGGAAGGGCT CCTTCAAGCC 
CAGCGAGCAT GTGAAACCCA GGGCCCCAGG AAACCTGACA GTTCACACCA 400 
ATGTCTCCGA CACTCTGCTG CTGACCTGGA GCAACCCGTA TCCCCCTGAC 
AATTACCTGT ATAATCATCT CACCTATGCA GTCAACATTT GGAGTGAAAA . 500 
CGACCCGGCA GATTTCAG/^ TCTATAACGT GACCTACCTA GAACCCTCCC 
T 

TCCGCATCGC AGCCAGCACC CTGAAGTCTG GGATTTCCTA CAGGGCACGG 600 

A ^ 

GTGAGGGCCT GGGCTCAGTG CTATAACACC ACCTGGAGTG AGTGGAGCCC 
CAGCACCAAG TGGCACAACT CCTACAGGGA GCCCTTCGAG CAGCACCTCC 700 
TGCTGGGCGT CAGCGTTTCC TGCAXTGTCA TCCTGGCCGT CTGCCTGTTG 
TGCTATGTCA GCATCACCAA GATTAAGAAA GAATGGTGGG ATCAGATTCC 800 
CAACCCAGCC CGCAGCCGCC TCGTGGCTAT AATAATCCAG GATGCTCAGG 
GGTCACAGTG GGAGAAGCGG TCCCGAGGCC AGGAACCAGC CAAGTGCCCA 900 
CACTGGAAGA ATTGTCTTAC CAAGCTCTTG CCCTGTTTTC TGGAGCACAA 

C 

CATGAAAAGG GATGAAGATC CTCACAAGGC TGCCAAAGAG ATGCCTTTCC 1000 
AGGGCTCTGG AAAATCAGCA TGGTGCCCAG TGGAGATCAG CAAGACAGTC 
CTCTGGCCAG AGAGCATCAG CGTGGTGCGA TGTGTGGAGT TGTTTGAGGC 1100 
CCCGGTGGAG TGTGAGGAGG AGGAGGAGGT AGAGG2\AGAA AAAGGGAGCT 
TCTGTGCATC GCCTGAGAGC AGCAGGGATG ACTTCCAGGA GGGAAGGGAG 1200 

C 

GGCATTGTGG CCCGGCTAAC AGAGAGCCTG TTCCTGGACC TGCTCGGAGA 

T 

GGAGAATGGG GGCTTTTGCC AGCAGGACAT GGGGGAGTCA TGCCTTCTTC 1300 

C T C . . 

CAqCTTCGGG AAGTACGAGT GCTCACATGC CCTGGGATGA GTTCCCAAGT 
GCAGGGCCCA AGGAGGCACC TCCCTGGGGC AAGGAGCAGC CTCTCCACCT 1400 
GGAGCCAAGT CCTCCTGCCA GCCCGACCCA GAGTCCAGAC AACCTGACTT 
GCACAGAGAC GCCCCTCGTC ATCGCAGGCA ACCCTGCTTA CCGCAGCTTC 1500 
AGCAACTCCC TGAGCCAGTC ACCGTGTCCC AGAGAGCTGG GTCCAGACCC 
C 

ACTGCTGGCC AGACACCTGG AGGAAGTAGA ACCCGAGATG CCCTGTGTCC 1600 
CCCAGCTCTC TGAGCCAACC ACTGTGCCCC AACCTGAGCC AGAAACCTGG 
GAGCAGATCC TCCGCCGAAA TGTCCTCCAG CATGGGGCAG CTGCAGCCCC 1700 
CGTCTCGGCC CCCACCAGTG GCTATCAGGA GTTTGTACAT GCGGTGGAGC 
T G A 

AGGGTGGCAC CCAGGCCAGT GCGGTGGTGG GCTTGGGTCC CCCAGGAGAG 1800 
GCTGGTTACA AGGCCTTCTC AAGCCTGCTT GCCAGCAGTG CTGTGTCCCC 
AGAGAAATGT GGGTTTGGGG CTAGCAGTGG GGAAGAGGGG TATAAGCCTT 1900 
TCCAAGACCT CATTCCTGGC TGCCCTGGGG ACCCTGCCCC AGTCCCTGTC 
CCCTTGTTCA CCTTTGGACT GGACAGGGAG CCACCTCGCA GTCCGCAGAG 2000 
CTCACATCTC CCAAGCAGCT CCCCAGAGCA CCTGGGTCTG GAGCCGGGGG 

T 

AAAAGGTAGA GGACATGCCA AAGCCCCCAC TTCCCCAGGA GCAGGCCACA 2100 

FIGURE 2 A 
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GACCCCCTTG TGGACAGCCT GGGCAGTGGC 
CTGCCACCTG TGCGGCCACC TGAAACAGTG 
GCCAGACCCC TGTCATGGCC AGTCCTTGCT 
AGGTCCTCGC CCCCTACAAC CCCCCTGAGG 
G 

TGGGGTTCCA CTGGAGGCCA GTCTGTGTCC 
GCATCTCAGA GAAGAGTAAA TCCTCATCAT 

AATGCTCAGA GCTCAAGCCA GACCCCCAAA 
GGGACCCACA TACATGAGGG TCTCTT 



ATTGTCTACT CAGCCCTTAC 
TCATGGCCAG GAGGATGGTG 2200 
GTGGCTGCTG CTGTGGAGAC 
GCCCCAGACC CCTCTCCAGG 2300 

GGCCTCCCTG GCACCCTCGG 
CCTTCCATCC TGCCCCTGGC 2400 
C 

ATCGTGAACT TTGTCTCCGT 

2476 



nGURE2B 
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ISOFOBMS OF 



MGWLCSGLLF PVSCLVLLQV ASSGNMKVLQ 
PTNCSTELRL LYQLVFLLSE AHTCIPENNG 

V 

LDLWAGQQLL WKGSFKPSEH VKPRAPGNLT 
NYLYNHLTYA VNIWSENDPA DFRIYNVTYL 

VRAWAQCYNT TWSEWSPSTK WHNSYREPFE 
CYVSITKIKK EWWDQIPNPA RSRLVAIIIQ 
HWKNCLTKLL PCFLEHNMKR DEDPHKAAKE 
LWPESISWR CVELPEAPVE CEEEEEVEEE 

GIVARLTESL FLbLLGEENG GFCQQDMGES 

AGPKEAPPWG KEQPLHLEPS PPASPTQSPD 
SNSLSQSPCP RELGPDPLLA RHLEEVEPEM 

P- 

EQILRRNVLQ HGAAAAPVSA PTSGYQEFVH 

R I 

AGYKAFSSLL ASSAVSPEKC GFGASSGEEG 
PLFTFGLDRE PPRSPQSSHL PSSSPEHLGL 

S 

DPLVDSLGSG IVYSALTCHL CGHLKQCHGQ 
RSSPPTTPLR APDPSPGGVP LEASLCPASL 
A 

NAQSSSQTPK IVNFVSVGPT YMRVS 



THE XL4Ra PROTEXN 



EPTCVSDYMS ISTCEWKMNG 
GAGCVCHLLM DDWSADNYT 100 
T 

VHTNVSDTLL LTWSNPYPPD 
EPSLRIAAST . LKSGISYRAR 200 
H 

QHLLLGVSVS CIVILAVCLL 
DAQGSQWEKR SRGQEPAKCP 300 
MPFQGSGKSA WCPVEISKTV 
KGSFCASPES SRDDFQEGRE * 400 

A. 

CLLPPSGSTS AHMPWDEFPS 
R 

NLTCTETPLV IAGNPAYRSF' 500 
PCVPQLSEPT TVPQPEPETW ' 

AVEQGGTQAS AWGLGPPGE 600 

YKPFQDLIPG CPGDPAPVPV 
EPGEKVEDMP KPPLPQEQAT 70*0 

EDGGQTPVMA SPCCGCCCGD 
APSGISEKSK SSSSFHPAPG 800 

825 
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SEQUENCE LISTING. 

<110> Genaissance Pharmaceuticals 
Denton, Rex 
Nandabalan, Krishnan 
Chew, Anne 

Stephens, J- Claiborne 
Duda f Amy 
Windemuth, Andreas 

<120> Drug Target Isogenes: Polymorphisms in the Interleukin 
4 Receptor TQpha" Gene 

<130> MWH-0002 PCT IL4R alpha 

<140> TBA 

<141> 2000-07-12 

<150> 60/143,435 
<151> 1999-07-13 

<160> 413 

<170> Patent In Ver, 2^1 

<210> 1 
<211> 28690 

<212> DNA 
<213> Homo sapiens 

<400> 1 

tgtgagctac tgtgtctggc 
atgaggtctc actatattcc 
ccaccttccc gagtagctgg 
tttctgttct ttgtaaatta 
taagctgact tgtaacgtta 
tgtctgtgtt agctcattta 
ccactttata gataaggaaa 
gtcagtgaca gagcaggtat 
ttgtccaagg aaggccacat 
tctggacttt gtgggtctgg 
aggacctgtt gtcggcagag 
caggcctgta atcccagcac 
caagaccagc ctggccaaca 
aggtgtggtg gtgggcacct 
ttgaacccgg gaggtggagg 
gagacagagc gagactctgt 
aatggccagg gcaggggagg 

1 



ctgaataata aaatttaaaa caatttttca aaaattcacc 60 
ctaggctggt ctcaaacccc tggactccaa gtgatccacc 120 
gactagagat gcacaccatt gcacccaata gagcaatacg 180 
cctgctctaa ggtatttttg ttatagcagc ctatatggac 240 
cttgagactt taaagtgttc cggtcactgt tggagggctc 300 
atccccacaa cacctcaatc agatggggct attcttagtc 360 
ctgaggcatg gaagcacagc ttgctcaagg ttcacatcta 420 
ttaaacctca ggaaataatc agagaaacat gtgtagaggg 480 
ccagaagcat ctcccaggac agttgttgtg tagctcaccc 540 
gtgttgtttc atgattatag agagagctct gtgaacgtgg 600 
acacaaatgg ccagggcatg gctgggcagc cgcagtggct 660 
ttcgagaaga ccagaggggc agatcatgag gtcagaagtt 720 
tggtgaaacc ccgtctctac taaaaataca aaaattagcc 780 
gtaatcccag ctactcggga ggctgaggca gaagaatcgc 84 0 
ttgcagtgag ctgagattgc accactgcac tccagccttg 900 
ctcggaaaaa caaacaaaca agcaaacaaa caaacaaata 960 
gttgcatatt gaataagatg agctctgctg gaagcacagg 1020 
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tcagcactaa cctgcttcct ctctctctgc 
tttgctctgg gctcctgttc cctgtgagct 
gtaagtcacc acttctcaat cattcatttg 
agtatgtcac ctggccttat ggagattaca 
tggccctgat tatcagtaaa ttctaaagat 
ggctcacacc tgtaatccca gcacttcagg 
ggagttcgag gtcagcttgg gcaacatagg 
tagccaggtg tggcagtgca cgcctgtagc 
atcccttgaa ctcaggaggt caaggctgca 
cctgcgtgag aaagtgagac cctgtcaaaa 
gaacagcata agagggggtt ggggaagctg 
agggaagtca gggaaggcct cattgagctg 
ctggggaagg ctcattcttg gcagagaggc 
acaggaggga atgtggtgga ggagagtgag 
gtgacagagt ggctgaatca aaaaagacct 
ccattatgtt ccaaaggact cttcaggatg 
aggaggtctg gagggaggtc tggagtggag 
gtagatttga ggacagattg gaattgaggt 
gactccaaga tttctgacct aaactactgg 
gaaggatgcc agaagagaag gtactttggg 
catgagataa gcttggaata tttatttgct 
cttatgcaag ttgttgtcag ctgagatggg 
cctgagtggt ctcatgctca ggtggtttag 
tcctgccatg gccactgaca agctaggagg 
ctccagccca tccatctcct ctctcccatt 
gggtacccgg tgatgaagac tgtggtatca 
gtggagacaa agaggggcgc agatagtggg 
gagggcagag gcttggtgtg atggaatggc 
aatttggcca agaacaagtt gtcaagtttg 
tgtctaatgg cccccgcacc catgggtatt 
gctgggaagc tccagcagcc ctggtcctgg 
ttgcctgcac tgtgcttttg tgctattccc 
acgcattgag ttcctgggcc gctcaggctg 
cttgcaggag cccacctgcg tctccgacta 
gaatggtccc accaattgca gcaccgagct 
ctccgagtaa gcctgcgctg gagctggagg 
ccaagagtga gctgggtcca ggtggtgcgc 
gctgtttata tggtgttaga ggggaggtcc 
gtggagcgca tggtgggagt gttggtccac 
ctagaccagc agttctcaca ctttttggtc 
tgaggaccca aagggctttg gtgtaggtaa 
ttgcaatgaa gaaatggtgt aatttttaaa 
actgaaatgg aggtctcact gtgttgccca 
tcctcctgcc tccgcctccc aaagtgctgg 
caaaatggag aaattttaag tcccaacaac 
tcaattacat gatcacaggt catgtagcct 
agagtgaaaa ggcaaataac gtccctgtat 
cagaccacac tttgagaacc actggactag 



aggtgccttg gcatctccca atggggtggc 1080 
gcctggtcct gctgcaggtg gcaagctctg 1140 
ttggctatta atggcgtgcc agggtcctgc 1200 
ctgcagtggg aggggacagc caatgacaag 1260 
tgttagaaag tgatgggagc cgggtgcagt 1320 
aggccgaggc aggaggatcg cttgagccca 1380 
gagaccttgt ctctacaaat aataaaatat 1440 
cccagctact caggaggccg aggtgggagg 1500 
gtgaactgtg atcgcgccac tccactccag 1560 
aaaaagagaa ggtgatgggg aaagaacaca 1620 
ggtggagtgg gggggattgc agttgaaagt 1680 
acttggagga agcgggaacc gtgcagatgt 1740 
cctgcactga gcctggcggg agggttgagc 1800 
cagcaggagg gagcagtgaa ggtcagcaag 1860 
tgcagtgttt gagcagagga tccatatcat 1920 
ccgtgtggag aaaggaagag ggtggaagcc 1980 
gagatgagag gctccggatc cctctgggag 2040 
gaaagacaga gaaagagaag tggccaggat 2100 
gaaggacgcg gttgtcattt ctgaaatgca 2160 
gaggggcggg aatcaggagt tagttttgga 2220 
atctaagaca gctccttaac atggtaagcc 2280 
cgtggcactg agcatgggag catggaggcg 2340 
caaactcagt gtacatcctg ccaattccag 2400 
gcgctgaaag gagaaggacc ccgatgtctc 2460 
ggccaaaccc aaccggaaac taaaggccaa 2520 
gcctcctgag cacagagagg gcagaaaggg 2580 
caaatgggga agtggcactt cccctagctc 2640 
actccttaaa ctgctacata ttttcccttt 2700 
tgtgagataa aggtgcactt ggttcgttct 2760 
tcttcagctt ccacagtcat cccgacacta 2820 
ccccagctct gtgggcgctg gccctcaact 2880 
cttggtcctg tttgggtgca agtccccctc 2940 
ctcctgtgtc tccccaggga acatgaaggt 3000 
catgagcatc tctacttgcg agtggaagat 3060 
ccgcctgttg taccagctgg tttttctgct 3120 
tttggggagg ttgtgcccaa agggtttgcc 3180 
tggagtgcag gatgctgagt atggtttgct 3240 
catctccagg gacatgttat gtaagataca 3300 
gtggcacatg gatacggctg gaatactgga 3360 
tcaggaccct ttttcacact taaaaatgag 3420 
cacatcattc tatgtttacc taattagaac 3480 
aaattaaaac aattaaaaat tttttttctt 3540 
ggctgctctc aaactcctgg gctccagtga 3600 
gattacaagc gtgagccgct gtatccggcc 3660 
atgcaagccc gcattcaaca aatcttcaga 3720 
ctagaaaatt ccactgtacg ccagtgagag 3780 
tatgatgaaa agagttttac ctggtgggcc 384 0 
acccttgatt gaggagtacg gtgttgagag 3900 
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tggagtcctc tgtgatggtg gatggaccag 
tgggctactc catggtgcac aggatgcttc 
acacaagaca cacagttacg gggcagactg 
caccagtaaa ggtggtattc catgattatt 
ttggcttagt ccagggattg gcaaactatg 
tttttgtaaa taaagtttta ttggaacaca 
tggccctcct ttatgttttt tgtttgtttg 
ttcgctcttg ttgcccaggc tggagtgcag 
cctcccgggt tcaagcgatt ctcctgtctc 
ctgccaccac acccggctaa tttttcgtat 
gccaagctgg tctcgaactc ctgaactcag 
ctgggattac aggcatgagc cactgagccc 
tcagaaatgg aaagtcaaat actgcatgtt 
tacacatggg cattattcca tgtaccatgg 
ggagaggggt gaaggaagag aagttactta 
gaccciaccag aaccccagac ttcaccacta 
gcttgccatc aaaattgcac ttgtaaggcc 
cagccctttg ggaggccgag gtgggcagat 
ggccaacatg gtgaagctcc atctctacta 
gcacacctgt aatcccagct actagggagg 
ggtggaggtt gcagtgagcc gagatcacait 
ctttgtctca ggaaaaaaaa acaaaaacaa 
tacaaataac caaaaaaaaa aaaaaaaaag 
ttaactatct ggcctgt-tac agaaaaagtt 
gggttggctg ttaaggtgga acattggaat 
gagcttatgt tgttctaaat taattagtgc 
gaacaagcta agtcccagga ccagggccgg 
tgcatgtggt aagaggctgt ggccagcaag 
accctggccg ctcccaagcc cccagatctg 
agtctgatgc ggttcctgga ggcatgtccc 
aggtgaccag cctaacccag cccctgtgtc 
aacggaggcg cggggtgcgt gtgccacctg 
tatacact^g acctgtgggc tgggcagcag 
gagcatggtg agcagggcgg agtgcggcag 
gggctgaggg tggggtgggc aggggaggag 
gcctgtattt tcccaaatct gatgggattc 
ctttgaaacg gagctggtcg cagtagacca 
ccctgaactt aagtgcccag gaaggcgtat 
cctgctgctg attgaaaacc gaactgggaa 
ctgctgcggc tttggatggt cttgaccgtg 
ccctgcagga atcccccttg agcttgctgg 
tgtattccag tgatgcatga tatgacatgc 
gatatgattg cttcaaagga gaccaagttt 
actcagtaaa tcatcataaa gtacagagat 
aaagctgaag tttgaggccg ggtgtggtgg 
gccgagccca gcggatcacc ggaggtcagg 
aaccccgtct ctactaaaaa tacaaaaaat 
cccagctact tgggaggctg aggtaggaga 



gacacatggc ataggagtca ggtggttccc 3960 
gttacactgg tgcccaggac ataatcacgt 4020 
gggatatacg gcacaccagc atgcagcgtt 4080 
ctaaggtaga tgggctgtgc tttgtttcca 4140 
gcccgtgagc caaatccggc ccactgcttg 4200 
ctggctgctg tagttgtaac agaaactgca 4260 
tttgtttgtt tgttttcttt gagacagagt 4320 
tggcacaatc tcggctcact gcaacctctg 4380 
agcctcccga gtagttggga ttaatggtgc 4440 
ttttagtaga gaccggtttt catcatgttg 4500 
gtgatccacc cgcctcagcg tcccaaagtg 4560 
ggcctcctcc tttatcttaa ttgaaataat 4 620 
ctcacttata agtaagagtt aaataatgtg 4680 
aataacagac attgaagact tgggagggtg 4740 
atgggcatag tgtacaccat ttgggtgacg 4800 
ggcagcatat ccagtgagaa cagatctgag 4860 
gggcactgtg gtggctcgcg gctgtaatcc 4920 
cacttgaggt caggagttcg agaccggcct 4 980 
aaaatacaac aattaactgg gtgtagtggc 5040 
ctgaggcggg agaattgctt gagcccagga 5100 
cactgtactc tagcctgggt gacagtgaga 5160 
aaaacaaaaa actcgtaccc cctaaattta 5220 
gaaattgtgt ggcctttgaa gtccaaaata 5280 
tgcagacccc tggcctagcc cgtgagatgt 534 0 
tatcttacga tggccaaact gtgcgatgca 5400 
caccggttct tccctttcat gggctttcag 5460 
cagctaggca ggtgtgagga gcatccttgg 5520 
agaggcaacc ctagtcggct gccccagcac 5580 
tcctcacatc cgtgatcggg aagctggaag 5640 
ggacacagct gtggggccca gccagcctac 5700 
tgcagagccc acacgtgtat ccctgagaac 5760 
ctcatggatg acgtggtcag tgcggataac 5820 
ctgctgtgga agggctcctt caagcccagc 5880 
gggtggctgg gtgtgtt ccc acagctgcct 5940 
gtggggtcat agcaacagca ggaggaagcc 6000 
ctgcccctgc ctgggcctca gtcctcccac 6060 
ccaagccccc ttcagcccag ctgtttccac 6120 
tgagatgagg tgtgcttgct ggaaggcatg 6180 
cattccttcc attctgtgtc cactggtcag 6240 
gaaggctgac cttcttctgg tacccggagt 6300 
gctgtggtga caggagttta aaacatgcgt 6360 
atcacaggaa taaaaacctg aggtctcatg 6420 
taaaacagat gaatcaaaat aaagaaaaat 6480 
gtggccaaag gtgtgaagga tgcagctgta 6540 
ttcatgccta taatcccagc actttgggag 6600 
agttcgagac cagcctggac aacatggtaa 6660 
tagtctggca tggtggcagg cgcctgtaat 6720 
atggcttgaa cccaggagaa ggaggttgca 6780 
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gtgagcttag atcatgctac tgccctccag 
aaaaataaaa ataaataaaa ataaaaagat 
ggctcataca ctttgccttc actgtagaaa 
catgcctgga atcccagcac tttgggagcc 
tattgaggct gcagtgagct gggattgtgc 
gaccctgtct caaaaaagaa gaaaaaaagg 
aattttagca acattattgt tattataatc 
gctccagaag tatgctatgg aggttttgtc 
gtaggccggg gctgggctgg gtgatggggg 
gcttctcccg cagtgaaacc cagggcccca 
gacactctgc tgctgacctg gagcaacccg 
ctcacctatg cagtcaacat ttggagtgaa 
ttgacgtttt tctgtgacct ctggggaaca 
ggagccagga gcctgggagg caagccctgg 
cctggcttct cacctggctc tgccctaggc 
tcacatcaga gaaggggagt tatctctgca 
tgggccacat ctgtgtttcc cagtttgggg 
agctcacttg aataacccta ttattgaaga 
gtgtcagttc ccttttgagg ccaacgggtt 
ttactaattc ctcttaatga agagagcaga 
aaaacaagat cagtttgttg gttgttcccc 
gtagtgttgg tggagttttt gttgttgttt 
agggtggaac acagtggcgt gatcttggct 
gattctcctg cctcagcctc ctgagcagcc 
ctaatttttg tatatttagt agagatgggg 
ctcctgacct cagatgatcc actcgccttg 
agccaccgca cctggccagt ggagttcctt 
gggcgagagg atttacagga aactatcagg 
tggctgagtc cccctatttt tctgctggtg 
ttccccaccc tgaatccacc ttcctcacat 
gggacctggg gtcccacagg gctgcaatcc 
gatgttgggc atatcccata acctctttgt 
tgacaagagc acccgcccac agggctatga 
ccctgtttgg cccacatgtg tcctaaaagc 
cacagagatc cccatctgcc cgctgtccca 
ttgtgagggc cggctctgtg catcccaggg 
agcaggcacc cgccacctta agctcccatg 
ggccaagcca ggggtgtcga tttatatctc 
caggtatggg gaaagagcag gaatggggca 
ggggggcatg gggggaagga gtgcacagat 
aataaatctc cttctcatgc cccaggcctg 
gctgggggga agagaaaggt accaccttca 
atatttctag atctgccttc aggcatggct 
tgcgccctgc tttattctgt attggcttcg 
gcagatatgc ctactagttg ctccaggcct 
ttttttcttg acggttctaa taagagtcct 
gctgtgccag cctctgaacc aatcactggc 
ccctgcttgt ccagggagga gtcacatacc 



cctgggcgac agagtgagat tacgtctcaa 6840 
tttttaaaag gctgaagttt gggttacttt 6900 
ggtggttagt aaagaccagg cgcggtggct 6960 
cagcgcaggc agatcacttg agccctgggc 7020 
cactgcactc cagcctgggc aacagagtgg 7080 
gtaattaata aacactaaag ttctatgtag 714 0 
ttctttgcta tggctctgaa tctgtgtggt 7200 
gaccaaaaat ctgggtggtg gctgtggttt 7260 
agtcactgca tagatcctca catagaggcc 7320 
ggaaacctga cagttcacac caatgtctcc 7380 
tatccccctg acaattacct gtataatcat 74 40 
aacgacccgg cagatgtgag tgggcatgct 7500 
gggtgggtga ccagcagagg cccagtccct 7560 
ggctggatag caaatcccag gagctagaga 7620 
aagtcccttt gcttcctggc cccccacccc 7680 
tgccgctcct cctctgtaaa ggtagggctg 7740 
gacacaagtg atcgtaggtg gcacattgac 7800 
gaataatact gactcaagag acagtgaccc 7860 
aaggaggaag tccccataca gctgactcgt 7920 
ggccacaccc ■ caggcttaga ctttcccaag 7980 
atggaagctg gtcctgacat tcccttcaca 8040 
gttttgagac agagtctcac tctgtcaccc 8100 
cactgcaacc tccgcctcct gggttctagc 8160 
gggactacag gcacctgcca ccgtgcccag 8220 
tttcactgcg ttggccaggc tggtctcaaa 8280 
gcctcccaaa gtgctgggat tacaggtgtg 8340 
cttaagtaca tgtattgaca tctttaaaaa 84 00 
tcagtaatgg caggggccgt ccacagtggg 84 60 
tgcagggagg tcatttcctg ccacccatgt 8520 
tcccattgga gggacaatct ctggacatat 8580 
aatgcctgct gtgccactcg ccagctgtgt 8640 
gcctcagttt cctcatctgt aacacaggag 8700 
cagtacaagg tgtgtgatac agatgagctc 8760 
catgtgccct ttctcttgag tgccccaggc 8820 
cacactggtc tgtcatttgt tccttgaggt 8880 
gcccaggctg ggcctggttg gctctcaggg 8940 
ctggtgtctg tcactgcttc ctctcaatct 9000 
tcaggtctgg tttccccttt ggcactgggc 9060 
gttggctcac acagcagagg ctcagaaagc 9120 
gctagagagt ggggcaagtt ttgtttggtc 9180 
tgcaagacct acagagagtc ccaaggatgg 9240 
gagtccaaag atatgttatt taatattttc 9300 
ggatccagct tctaggaacc tgtccagctc 9360 
tttttaggca ggctcttccc tcatgtagtg 9420 
acatcccaaa gccacagtgg gaaaagggtt 9480 
aaggctgctg ctcagtggcc tggcttcgat 9540 
tgtgggtgga gagagggtgc tggtggaggg 9600 
tgcctctagg gctgcaggtg ggctcagctc 9660 
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catccaaacc agatgaactg aaaataaggc aggagtggct tccccagggg aaactgggga 9720 . 
agaggaagca ggactgtgct ggctaaaatg ccagccaggt ttaagacgtg gcaccagatg 9780 
ccagtcatgg gattggattg gtcagcatgc ctgggctatg gcttaggggt atgttggtgc 9840 
tcagggatgc cacaggcctc cagataccag gtctgaggca gaagaatgaa gtccagcttc 9900 
tcttgtgggt ggaacagtgg caactgagat accccatctc tcccttccca agaacagagc 9960 
tgaacataaa gaatttagtg attggccaga gcttggccac atgctcccct ctgatgaatg 10020 
ataggccagg tgatgggatt ggcacaattg gcttagacta atgagggttg gccctggagt 10080 
tgcaggcagt ggagttctgt cctaagcagt gggcacctaa acccgatggc ataaaagctg 10140 
ggcgggtgtc cacctgcatc tgccacagca ctataggcac caactgtggc tcatactgag 10200 
tgggataaat tccagaaaga aacattagga acttactata gaattttggg gctagagcta 10260 
ctcattcatt cccctagata atttctaggc aaggttccat agtggagggg gagttttggc 10320 
ttgggcattg aaggatgcat aggagttttc tagatgggga aagaagggaa cggtagacca 10380 
ggcagaggga actgcatgat aaaaggttta tgggtgtgaa aattcatgga atgtttgagg 10440 
attatggggt tgggggatgt gggaatatgt gtagcgataa agcaccaaac aaagccaaaa 10500 
gtttagttag agccctgaat gcctgcctca taatggtttc catattttat atgcctacta 10560 
tgtgccaggc acattgctca gggtcacaca gctggaaatg gcagggctga gtttttgttg 10620 
ttgttgttgt tgttgagaca gagtctcact ctatcaccca ggctggaatg caggggcgtg 10680 
atcatggctc actgcatcct tgacttcctg ggatcaggtg attctcccac ctctgcctcc 10740 
caggtagctg ggactacagg cacaggccac cacgccaggc taattttttg tatttttagt 10800 
agcgacaggg tctcgccatg ttgtccgggc tggtctggat ctcctggctt caagtgatcc 10860 
ccctggctca gcctcccaag gtgctgggat tacaggcttg agccaccgca tccagcccag 10920 
atctgagatt tgcacccagt atttgaactc ccaagcctgt gctctttttc ctcccatgga 10980 
catttctctc agagatggtc tcccaaacac ctgtccttct tgttaaaaaa cagacaaacc 11040 
gcaagtagtt ctttggaagc tcagatttct cttttgtttc ttagtaaaac atttcccagt 11100 
tcccagctcc cttccagggt gtaagatttc ttcggtaact tacatctagc tgttgcttct 11160 
tgtttgctca tgtttagaaa gaaagacaaa agagagtgag aattttctct cccttcccca 11220 
gtctccccac aactcacacc ccaccctcag ctccctctgt aataggaaaa tctctgaact 11280 
ctctgtagtt gctccagcaa tcttttggaa ctttgcttct ttcttgtgaa aaaacctccc 11340 
cttggctcac tttgcaccag gtttccccaa atgtgcttcc aaccacaagc agaaatggag 11400 
ctgccagtaa ccaggaagaa actgccgggg gctgaggaag aggagaggga • ggtgcatagc 114 60 
cctggatctc gcagggagag ggg.tgacagg atgagaactc aggttgctca cttgccatca 11520 
gggtcagtca tgaatatagc gttcatgtat cactttttaa agcttttttg gagggtaaaa 11580 
gtaatagtta cacaaaataa aaatacaaat ggtacaaaag gacttagaat ggaaacatgt 11640 
ttctctcccg actccagcct cctgtttttc ttcccagaga ctgaccactg ctgtctgtct 11700 
cttgccagaa gggaaaggga ggcaaggtta gggcaggcag agggcatgtg catcctttag 11760 
agagagctta tgtctataca agcaaatgtg tgtgttcagt catcgctgtc ttagttttct 11820 
attgctgcat aataatggta ctaccagctt cacagcttta aacaacaccc atttattatc 11880 
tcatagtttc tgtggttggg agtctggaca tagcttagcc aggttctctg ctttagagtc 11940 
tcgtgaggct ataatcaagg tgtgggatgg ggctgcagtt tcatctgagg ctcaattggg 12000 
gaagggtcac ttctaagctc atacaatatt ggtgacattc agtcccUggc aggctgttga 12060 
actgagagcc tcagtttcgt gctggctgtt ggttgtagtt aaccctgaat tccttcccat 12120 
gtgccctttg caaagccatc aaggcagaga gacttgccta gcaagtagga tattacagtc 12180 
ttctgtaata taatcacatc catgaaatcc tctatatatc ccatcacctt taccatattc 12240 
tgtgggttag aaacaagtag caggtcctgc ccacactcga gaagaccaga tgacacaaag 12300 
atgtgattca aagtggggat catcggggcc atcttaggtt tgtctgcagt gatcactgtg 12360 
ccatctctct ctctctcttt tttttttttt tttttccgag acgaagtcgt cactctgtca 12420 
cccaggctgg agtgcagtgg catgatctca gcttaccaca atctctgcct cccaggttca 124 80 
aatgattctt ctgcctcagc ctcctgagta gctgggatta caggtgcccg ccaccacacc 12540 
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cagctaattt ttgtattttt agtagagaca 
gaactcctca cctcaagtga tccacccact 
atgagccacc atgcccagcc ccatctctct 
aaaaagaagc agagaacaca tacacatctg 
ttggaagtca cttctcagta gaggctaggt 
gtttggactt gaccatggag acactaggaa 
ccctgccagt cctttagaaa gatcacccag 
tacttacgta tattaaccca tttccttatc 
cttttattat cttcattttg cagatgagga 
cttaaggtca cacagccagg aagtggtagt 
attgaattat ctcacgaaaa ctcagaaagg 
aattagtcga ggagcctgac gcatgttgct 
gcaatggcct gaacaggacg aacaaccaaa 
gggtcagcta acgacagcaa ccagggtggg 
ctaacatctc ccttttctct accagttcag 
cctccgcatc gcagccagca ccctgaagtc 
ctgggctcag tgctataaca ccacctggag 
ctgtgagtat caagaggcct aagcaatggt 
cagacacttc ccctggctga gtctctgggc 
tcctgccatt agatacacct gctgtggtgt 
aacacacaag cccacaattt ccatggctta 
aacaagctaa cgtgaatgtt gctggtttgt 
gagtgaggtt ctttccatct tgtggtgcca 
ttttgtgtcc tggtttcctt tggcagcctg 
ttttaaatac ataaaatccc agcctgggca 
tagccaggca tggtggcatg cacctgtagt 
atcacttgag cccaggagtt tgaggctgca 
cctgggtgac agagcaagag cccatctcta 
aataaataga actacagagg aaactaattg 
acatttttaa tctagagata tatgtgcttc 
ggggtagctc gcataaatac tgtaatttca 
tactgaaatt gtgatgtgat atgagaatag 
cactgatgct gtcattactg gggtctcttc 
ttttattttt aaaattttaa aataaataga 
ttttgacctc ctgggctcca gtgatcttcc 
aagtgggagc cagtgttcct ggccccttcc 
tgagaggtta gggaaagtaa agatgtaatt 
attctacctg. cagccatgtt ggtccatcaa 
ccaccattgt ctgtatccag ccagcagaag 
ttgaaagcag acagcccgga agtgggccgc 
atggtcacat gactacacat agccacaaag 
catcccagaa acaactcttt tcagtgaggt 
gattatctca cataaaatga agtcttacaa 
agtgatatct cttcccttct gctgtcttcc 
atccttagac ttgccaccca tgccctcagg 
aggaaggaaa ggaaaggctg ctacgagaga 
gaagcaaaca tctttctaga agcttcccta 
gactgttaca tgttacatgg ttactgttat 



gagtttcacc atgttggcca ggctggtctt 12600 
tcggcctccc aaagtgctgg gattacaggc 12660 
ttaaaaaaca aacaaacaaa caaaaaacat 12720 
catcttccct tgtttactta acaatagatc 12780 
tgggcagagc attggattct aggccagtga 12840 
gcccatgaag gacagagaga gatgcctcga 12900 
tgctttttgt ataccaaacc ctatttgaaa 12960 
accacaaccc tgcgggaagg gagataggca 13020 
cattgaggtc cagagaggtt atgtcactta 13080 
agggactctt acccttgttt tacagatgag 13140 
ttaaacaact tgcctaagta acatacagct 13200 
ctagcctggt cacagttaca gaggtggcaa 13260 
tacccaggct ggtggctctt aaacatggtg 1332.0 
cactggtgcc cctcgccccc ggctggtgcc 13380 
aatctataac gtgacctacc tagaaccctc 13440 
tgggatttcc tacagggcac gggtgagggc 13500 
tgagtggagc cccagcacca agtggcacaa 13560 
aatctccact ctccattctt cccctgtggc 13620 
ttttatatca taggatgcct ctaatggcaa 13680 
atctgccagg taggcaggct aggctgcagt 13740 
acactatagg aatatatttc ttgctcatgt 13800 
aggtggtttc cctccctgta gaaatctggg 13860 
tcattctcca ggacaaagat tcttacctac 13920 
gtgaagccta tggacctcat ttcagaatat 13980 
atatagtgaa acccccatct gtacaaaaat 14040 
cccaggtact gggaaggctg aggtgggagg 14100 
gtgagccgtg atcgtaccac tttactccca 14160 
aaaataaata aatacaatga aataaaataa 14220 
tattgaaatg cagttataaa acatttaaac 14280 
tttattaaga tctataaata ataagttcta 14340 
aagtagataa gcataaataa tactttatga 14400 
ctgtgagttt tgttttgctg gggacaggat 144 60 
cctccattct ttttttaaaa ttgtatttta 14520 
gacagggtat cactatgttg cccaggctgc 14 580 
catcttggct. tcccaaagtg ctgggattac 14 640 
tccattctta atggaaggag atgctaggtg 14700 
tctttcccat ccaagttctc agacccctga 14 760 
ccccaagtga agaatccctg ctctagggcc 14820 
aggcgtgatt atggagatca catctgcttc 14880 
atcacttcct ctcaaattct attggtgaaa 14 940 
gaggctggga actttctcac ttggaaccta 15000 
atcccacagg tctttcgcag "tagaaatatt 15060 
atggacctac tgggttttgt acagcagcca .15120 
cttctgccgt ccttcacatg gtggcattgt 15180 
ttggccgttg cacactgtct tacataaagc 15240 
gtgtaccttg tgcatctctt ttttaatcag 15300 
gcaaaattcc ccttacatct cattggccaa 15360 
tacttgctca ttgcaaggaa gactgggaac 15420 
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tcaaatgcct 
ggctccctga 



ggaaaaagga 
caagggagag 



acaggataat cgtgattggc 
aggaaaaagc tgttgagtga 



tcaagcctta 
agaagactgc 



gggtgggcat 
ttcagtttcc 



15480 
15540" 



ccatctgtat aatgggagga gtaagggctg tcgtgaaaac tcaatgaaag aagattcttc 15600 
aacgtggtag gtgcagtggc agctggcagt accctgaccc tgccaccgca cagccctctc 15660 
agcattgctc atcctgcact gtggatatca gttgagccac gtgtctcctg ccctgggctg 15720 
tgagctccat aggcagggtc tccatggctg tatctccaga acccagcaca gaaccaggtg 15780 
cttgggaaag ttttgaattg attctcatct gccattggca tggggaaggg aactagcttg 15840 
tatgaaacag ataacaatgt atgggaccct cattcattat ttcagcaaat atttgctgag 15900 
ttcctcctac atggctagcc ctgtgctaga cactggggaa tcggcgatga acaaagcaga .15960 
tagaaatccc cactcttgtg gagctgacat tctggaggga gagacaaaaa gcaaacatat 16020 
aaagaaagaa agaaatcaca tggatctgga tgacagtgag tgctgggaag aaaataaaag 16080 
cagaggaagg ggatggagcg atgggcaggg ggcaacggta gggagggtgt cggggaaaac 16140 
tttttggaga atgtgacgat gaaagtgaac aaggagaagt caaccgtgtt gagatgatgg 16200 
cagctaatga tgtggacagg ccactctgtt ctgagtgcat tatctattga ttcatcatgt 16260 
catcctcgca acagccctgc acgatcaatt ctgtcattaa ccccatagta cagatgagga 16320 
tgcggaggca cagagaagat aagggacttg tcctgtgtca cacagcaagg agccatccgg 16380 
ctcctaagtt ggtgcatttg acttctgtgc ttccggaaag aaagagcagc aagtttaaga 16440 
tctggaggtg gcactgagct ttggaggagc agggggcaat gaggtggccg gtgtgacgag 16500 
gactcaatgt gcaagaggga gagtggtggg gagatgaggt ggaggggtgg tcggcggtca 16560 
gatcgtggag ggtctcggac gagggtcctg accctgggtc tccagtcctg ggaagtggag 16620 
cccaggctgt accatggctg acctcagctc atggcttccc ctcccacttc cagcctacag 16680 
ggagcccttc gagcagcacc tcctgctggg cgtcagcgtt tcctgcattg tcatcctggc 16740 
cgtctgcctg ttgtgctatg tcagcatcac caagtgagtc ctgggcccag tgctgccgag 16800 
cagtccctct ggagtgcagg gtggcaggga cttgcccctc tagtctgccc ctttgcagtc 168 60 
ctctcagtca ataatacgta tttactgagc agctactaca caccttgaga gtagagctga 16920 
gaacatatcg acaaggaccc cacttttttc tttttttctt tttttttttt ttttgagacg 16980 
gagtctcact ctgtcaccca ggctggagta tagtggcaca atcttgccta acagtaacct 1704 0 
ccgcctcccg ggttcaagca attcttctgc ctcagcctcc agagtagctg ggattacagg 17100 
cgcatgccac tatgcccggc taattttttg tatttttggt agagatgggg tttcaccatg 17160 
ttggtcaggc tggtctcgaa ctcctgacct catgatctgc ctgcctcagc ctcccaaagt 17220 
gctgggatta caggtgtgag ccactgcacc caaccaggac tccacatttc taaaaccggc 17280 
atcctactgg ggagactgaa aatacatatc aatcacaaac aggtggtttt ccatagtgac 17340 
ccactctctg aatgcactag accagggtgg aggccagaga tcttctgggg tgctttttgc 17400 
aagggggacc aggataaggc tctccaagga gggaaaattt gaggggggcc ctgactgggg 174 60 
agaatgagct ggccagggat aagcaagatg gagtcatccc acatcccctt acaacactgg 17520 
gtgcctgggc aactgggggc atttgggggc atgtggtagg agccagagga atttgcgacg 17580 
attgccctga tggagtcagg agacctgggt ttgaatcctg gccttggagc ttggtagctg 17640 
gcggccgaca agttgctgaa acccctgagc ctggggttcc tgctttgcag agtgacagtg 17700 
atggtgagaa catatttcat cagccagaag aggccaaatc acagtaaagg ctgagggagg 17760 
agatgagtgg cgagtggctg ggaggtggtg gaaggagcct cgtttccaga • gagctcttgc 17820 
cagcccttgg aatcatggtg tctcagagcc tcagtcctcc catctctgaa atgggactag 17880 
caagctcaac ctcactaagt caggattaga ggtggctaag gattattaac atgattgatg 17940 
aaagtgccca ctcttggccc agcacacact aggtaggcag ggaatgcaaa ttcccctcca 18000 
tatcttgtca ctgatgcctc cgagcaacct tggactgatc gccttgctct gagcctcagt 18060 
ttccccatca cctgtacctc ttcccactcc ccatcactat atcccagcat gccagcctct 18120 
ttgctgttct ttgtctttgg tttcttgttt tgttctgttt tttagacagg gtctcactct 18180 
gttagccagg ctgaagtgca gtggcgcggt tacggctcac tgcagcctcc aattcctggg 1824 0 
ctaaagagat cctcccattt caacttccag agcagctggg acaacaggcg cttgccacca 18300 
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cacctggcta attttcttat tttaatttaa 
tctcaaaaac caagctagag tgcagtggtg 
cgggttcaag cgattctcct gccttagcct 
acgacaccca gctaattttt gtatttttag 
atggtcttga actcctgacc tcaagtgatc 
acaggcatga gccactgtgc ctggccaatt 
ttatgtagcc caggctgatc ttgaacttct 
accatgaatg aatgacctca tataagcatt 
tggagtctca tgttgtcccc caggctggag 
ctccgccttc cgggttcaag cgattctcct 
ggcgcctgcc accatgcctg gctaagtttt 
attggccagg ctggtctcga acttctgacc 
agtgctggga ttacaggcgt gagccgccat 
-atttatcatc catctttccc tcttgaaggt 
gttcactgtt gtagcctcag ggccaggcac 
atatgtgttt atttattgaa accatgggca 
tacctcccag tgatgatgca ccatggcccc 
ttctccgtag gtcccccagc ttcccaggcc 
cagctgctct gaaaaaggga ttgttgaggg 
tctttcctaa tcccctgggg tctcattccc 
tctgtgctgg gtaacagaat gcgggagtgt 
ctccacctcc ttgggcctca ctctcctcat 
gtcaccaggt tagaatgatg agtcagtggt 
ctgatgccaa ataagtattg gtgataacga 
aataattcag agatcaacac cttttccttt 
atcagattcc caacccagcc cgcagccgcc 
taggagtagg cgtggatgag gacatgtggg 
cacctgggct gttaaggacc ttcactggct 
ttgcagggga gacagaggca gaagccgaat 
cccttcccct tggcaatccc agcctggggt 
ttccctcccc ttgggagaat gacccttggg 
tgccagtacc agggctagag gccagaagga 
gtgtcttctt cttaagcata ggaaactgcc 
ctcattggcc tgaaatgtat cccaccccta 
gatggtcctg tgggtgagtg ggtggggttt 
gagaaagggg gattataggt cctgtctaga 
gggactctgg aatctagtgt tggctttgaa 
acttgagcaa gttgcttaat tctctgagcc 
ttacctggta ggacagatat tggaatttat 
gcctcttgca aataccaagt gccatttatg 
accccgaaat gtagagggtt acaacaactt 
ttggggctgg gctcagcagt gagggtggcg 
atctgaattg agctgacccg tccccgtagc 
tttttttttc tttttctgag acggagtttt 
atcttggctc actgcaacct ctgcctcctg 
caagtagctg ggattacagg catgtgccac 
agatggggtt tcttcatgtt ggtcaggctg 
ccacctcagc ctcccaaagt gctgggatta 



ttttatttta ttttttggga cagagtggag 18360 
cgatctcgac tcactgcaat ctctgcctcc 18420 
cccgactagc tgggattaca ggcgtgtgcc 18480 
tagagatggg gtttcaccat gttggccagg 18540 
cacccacctc gttctcccaa ggtgctgggt 18600 
ttcttacatt ttgtagagac tggctgtcac 18660 
acccctttat ctttattcat ggcacttatt 18720 
tctttcgttt tttttttttt ttctttgaga 18780 
tgcagtggcg cgatctcagc tcactgcaac 18840 
gcctcagcct cctgagtagc tgggattgca 18900 
gcatttttag tagagacggt gtttcaccat 18960 
tcaggtgata cacctgcctt ggcctcccaa 19020 
gcctggcctc atataagcat ttctgtctcc 19080 
cagtttcacc aaggcaggca tctttgtctc 19140 
agtgagtcaa acatagaagg tgctcaataa . 19200 
gaggctaatt cagaagcggt ctgaggacct 19260 
aggcaggcca ggaagagaga agggttgtgf 19320 
atcccaggcc attccct.ggt catttgccct 19380 
gaacctagaa tcctctctct gcagtttgag 19440 
actgaggaca taggtggcct cctcaggaac 19500 
gaacctggct ctgccaccta ccagctgtca 19560 
ctgtagaata gggttagcaa tagaatccat 19620 
ttgacctcca gaaactaatc agcctgatct 19680 
ccacttttat gggaggagcg ttcacctgtc 19740 
tgtttttcag gattaagaaa gaatggtggg 19800 
tcgtggctat aataatccag gatgctcagg 19860 
actgtgtaca tgaagaagtg tggttcagaa 19920 
tctggaatgg caaatagaca gtcaggaggg 19980 
gaggtcatta gcagaccaga ggctttcccg 20040 
gggcttctct ggggttggtt tcctgttttt 20100 
tcatcatcac tgtgtcattc cctggggagg 20160 
gtggaggaag gagagggtga caggctttct 20220 
cccgaagcac tagcaaatcc cttccgggtt 20280 
agccaggggt ggagtcagct tccccaaggc 20340 
gcctgagcaa gatgagagtt ctctaggtag 20400 
agagaaggtc" tgagggtcct tgcttttcca 20460 
tcctgactct gccactcact ggcagtgtgg 20520 
tcagtttcct cttgtgggtt ataacagtgt 20580 
tgagacaata catataaagt gcatattcca 20640 
tatcagttag tgtttgctgt gtaacaaatg 20700 
tatttagctt atgcttctgc aggctggcat 207 60 
ggggaggctg ggctgggctg ggctgggcag 20820 
ctccctccgt gtctgacagt tggctttttt 20880 
gctcttattg cccaggagtg caatggcatg 20940 
ggttcaagca attttcttgc ctcagcctcc 21000 
cacgccaggc taattttgta tttttaatag 21060 
gtctggaact cctaatatca gatgatccac 21120 
caggcgtgag ccactgcacc cagcctagtt 21180 
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ggctgacttt tacctgggac agtgcaggtg cctgagccat gtgcctctca ctctccagca 21240 
ggccggccca ggcttgttta cagagtggct cagttttcaa gggtgggaag tcccaaggct 21300 
tcttgaggcc taggcgcagc actggcatga tatcacttcc atcacattct atgggcccaa 21360 
gcaagtccca gggccagtgt agattcaagg gatgggagga gattcagagc actcctctgt 21420 
ggccactttt gccatcgacc acagtccctg taaatattag gacaatgtaa ttaattccca 21480 
ggaatctgag gctcagaaag cgtaagtgac ctgttggact tctgatctgt gtgatgtcga 21540 
ggcttgtacc ccttcctgag cattgccgta ctccaggccg "ggctgcaagg ccactctgct 21600 
ctttcattgg ctgtctctgt attttagggg tcacagtggg agaagcggtc ccgaggccag 21660 
gaaccagcca agtgcccgta tgtatctgaa cttaggtcac agcctgcatg cattgggaag 21720 
gtgatagaat tggagaggca agcccctagc tccatgtctg ccttctcttc cctgcattcg 21780 
gtaattgccc tgtgacatta gccttcaagg gacggcagga ggaggggtgt tctggaaacg 21840 
tggactgctg gccaagcccc ctgagtttca ctggtgtgtc aggtacatgg tgatacccct 21900 
tgggagtgct gttatagtta acaaccagag cagccgtgcc tgttgttaaa atcttgacct 21960 
aattgtatac ttgtcggcaa atagccacta tcctgaacac tcccctcctt ttttttaata 22020 
tacaggatct cactctgtgg cccaggctgg tgtgcagtgg tgcgatcata gctcactgca 22080 
ccttcaaact cctgagctca agtgatcctc ccatcttagc ctcccgagta gctgatacta 22140 
cagatgtgca ttaccacgcc tggctatttt aaaaggtttt tgcctgtaat tccagctact 22200 
caggaggctg aggcatgaga atcacttgaa cccgggaggc agaggttgca gtgagcgcag 22260 
attgtgccac tgcactccag cctgggcgac agagtgagac tcttgtctca aaaaaaataa 22320 
taccaaaaaa agtttttgta aagacaagct ctcgctgtgt tgccccgcca ctgtggcctc 22380 
cttagcttct tccctggggc ctgctggacc tttccatact ccagaaacta aagggggtcc 22440 
aggaccctgc ttcaacccta ggatcccgca tctttttttt tttttttttt ttttggacgc 22500 
agggtcttgc tgtgtccctq aggctggagt gcagtgattc actgcagcct caaactcgtg 22560 
ggctcaagtg attctctagc ctcagccttc taagtagctg ggactacagt catacaccaa 22620 
catgcccagc taattttcct tttttttaat tcttgtagag atgtttgaga cggcttgggc 22680 
tctgttgccc aggctgttct caaactcctg agctcaagcg atcctccctc ctcagcctcc 22740 
taaagtgctg ggattacagg cgtgagccac cgcacccggc ttccatatcc tttctaattg 22800 
gtcatggctt gggataatgg tgttgctttt aattatcatc atccataaag actttttctt 22860 
actcaacaga tctgagcttg tatttggtgc ccaggacatg tgctgggttc ccgaaatccc 22920 
^aagacacag accctaccct cagggatttc tcattctagc aacatagact gatcaattac 22980 
tgattataac gttagaaggc atgtctgaag tagacagcca tcaggacatg gtgatttcag 2304 0 
gctgggcttt gaagaatgaa taggagtttt ' tcaagtgtcg aaactgaacc ctgaccaacc 23100 
tttgcttttg cagacactgg aagaattgtc ttaccaagct cttgccctgt tttctggagc 23160 
acaacatgaa aagggatgaa gatcctcaca aggctgccaa agagatgcct ttccagggct 23220 
ctggaaaatc agcatggtgc ccagtggaga t-cagcaagac agtcctctgg pcagagagca 23280 
tcagcgtggt gcgatgtgtg gagttgtttg aggccccggt ggagtgtgag gaggaggagg 23340 
aggtagagga agaaaaaggg agcttctgtg catcgcctga gagcagcagg gatgacttcc 23400 
aggagggaag ggagggcatt gtggcccggc taacagagag cctgttcctg gacctgctcg 234 60 
gagaggagaa tgggggcttt tgccagcagg acatggggga gtcatgcctt cttccacctt 23520 
cgggaagtac gagtgctcac atgccctggg atgagttccc aagtgcaggg cccaaggagg 23580 
cacctccctg gggcaaggag cagcctctcc acctggagcc aagtcctcct gccagcccga 2364 0 
cccagagtcc agacaacctg acttgcacag agacgcccct cgtcatcgca ggcaaccctg 23700 
cttaccgcag cttcagcaac tccctgagcc agtc$iccgtg tcccagagag ctgggtccag 23760 
acccactgct ggccagacac ctggaggaag tagaacccga gatgccctgt gtcccccagc 23820 
tctctgagcc aaccactgtg ccccaacctg agccagaaac ctgggagcag atcctccgcc 23880 
gaaatgtcct ccagcatggg gcagctgcag cccccgtctc ggcccccacc agtggctatc 23940 
aggagtttgt acatgcggtg gagcagggtg gcacccaggc cagtgcggtg gtgggcttgg 24 000 
gtcccccagg agaggctggt tacaaggcct tctcaagcct gcttgccagc agtgctgtgt 24060 
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ccccagagaa atgtgggttt ggggctagca 
acctcattcc tggctgccct ggggaccctg 
gactggacag ggagccacct cgcagtccgc 
agcacctggg tctggagccg ggggaaaagg 
aggagcaggc cacagacccc cttgtggaca 
ttacctgcca cctgtgcggc cacctgaaac 
cccctgtcat ggccagtcct tgctgtggct 
caacccccct gagggcccca gacccctctc 
gtccggcctc cctggcaccc tcgggcatct 
atcctgcccc tggcaatgct cagagctcaa 
ccgtgggacc cacatacatg agggtctctt 
agatgaggac tagggcttat ccatgcctgg 
tggcagattt ccaaaagact tgaagaacca 
tggcctaaca ctgggctgca gagactggac 
tcccatgaga gtagagggca ctgggtcgcc 
tgaggccctt gggcacctcg acttgtgaac 
gcagcagact gtccctgttg taactgccca 
acatggaggc ccacctgcct ctgtctcact 
agccatcaag ggaatgactt gggcggcctt 
agggtggtca ttgcctagag gtgctcattc 
gaggcagaat cccggctgtc aaggggtgtt 
aattgctgtg actaaagcag ggacaatttg 
tgggggctcc tcgtatgcat ggaaccccca 
gggcaatcca gacagcaggc ataaggcacc 
gtgctaggga aggcgggaac cttgggttga 
cacctgttat ctgtgtttgc tgaggagagt 
aaagtttctt tgtctcttta ttttttatgt 
tgagtgctgt gtctctgtta actcctggaa 
aggttaagaa acttgccgtc tgggtttggg. 
taggaagtaa tcccgggaaa ccctgctaag 
aaaccaagaa agtgttacct gaaaggggtc 
atctcatgca agaaagaatt cagagcgagt 
gaaagtaaag gaataaaaga atggctactc 
ctgcctattt ttatggttat tgattaatta 
tcccttttag accatatagg gtaacttcct 
ggcgctgttg ggagtgtagc agtgaggaca 
tttggtgggt tagagccatc ttctttactg 
cttgtatcgg tgacgacctc ctgtctcatt 
aatgcagccc agtaagtctc agcctcattt 
aataaacctc tgacaaaagg gtgagttatt 
ttacctgccg gggatctctg gaagaccatg 
gagagggagc tggggtattt gtccaccagc 
caggagcatt aattctccag cacttccagc 
agagttggag gtgttgagag actctggcac 
tggctgatga tacactgcca aggtcacaca 
cagacagtgt ctccatcacc caggctctct 
gcagagggat ggtgtgggcc ttagatggga 
atcacttttg cttctctgag tagctgccct 



gtggggaaga ggggtataag cctttccaag 24120 
ccccagtccc tgtccccttg ttcacctttg 24180 
agagctcaca tctcccaagc agctccccag 24240 
tagaggacat gccaaagccc ccacttcccc 24300 
gcctgggcag tggcattgtc tactcagccc 24 360 
agtgtcatgg ccaggaggat ggtggccaga 24420 
gctgctgtgg agacaggtcc tcgcccccta 24480 
caggtggggt tccactggag gccagtctgt 24 540 
cagagaagag taaatcctca teat cct tec 24 600 
gccagacccc caaaatcgtg aactttgtct 24 660 
aggtgcatgt cctcttgttg ctgagtctgc 24720 
gaaatgccac ctcctggaag gcagccaggc 24780 
tggtatgaag gtgattggcc ccactgacgt 24840 
cccgcccagc attgggctgg gctcgccaca 24900 
gtgccccacg gcaggcccct gcaggaaaac 24 960 
gagttgttgg ctgctccctc cacagcttct 25020 
aggcatgttt tgcccaccag atcatggccc 25080 
gaactagaag ccgagcctag aaactaacac 25140 
gggaaatcga tgagaaattg aacttcaggg 25200 
atttaacaga gcttccttag gttgatgctg 25260 
cagttaaggg. gagcaacaga ggacatgaaa 25320 
ctgccaaaca cccatgccca gctgtatggc 25380 
gaataaatat gctcagccac cctgtgggcc 25440 
agttaccctg catgttggcc cagacctcag 25500 
gtaatgctcg tctgtgtgtt ttagtttcat 25560 
ggaacagaag gggtggagtt ttgtataaat 25620 
attaaccaaa catacctcca gacactgctg 25680 
ttcacccatc cagaggaacc aggatgcaag 25740 
ttccccatac aaggattcaa atagttgatt 25800 
gtagtgggga actgaggcag ggaaggacac 25860 
cagatgcaga ccccaaaaga gggttcttga 25920 
ccatagagtc agtgaaagca agttaatgag 25980 
cgtagacaga gcagccctga gggttgctgg 26040 
tattccaaac aaggggtgga ttattatgcc 26100 
gatgttgcca tggcatttgt aaactgtcat 26160 
accagaggtc actcttgttg ccatcttggt 26220 
caacctgttt tatcagcaag gtctttatga 26280 
ctatgactaa gaatgcccta acctcccagg 26340 
tacccagccc ctcttcaaag • ctccagttta 26400 
caacagatta ccagcatgag taactgatgc 264 60 
catggcacat gcccagttat gcctgcaaag 26520 
tcccatctgt cattggctga gagctgcttc 26580 
tactccagga aaaaaaaaat tcttcaactg 26640 
accaagaaga caggaacagg acaccaacag 26700 
gctagttagc aacagatcta tagtggaatc 26160 
gtagtgatct gcgcttcaca tccgaggcag 26820 
aggctgggaa cctgaagctc ctatgtctgt 26880 
gatttcacac ttgaggggct tggccatttt 26940 
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agattccttc ctgctctagg agcctacata 
ctcacatgca gcctgatgtt tgttagaaac 
gctgtaatcc cagcaatttg ggaggctgag 
agaccagcct ggccaatatg gtgaaaccct 
gtgtggtggt gggtgcctgt aatctcagct 
caacctggga cgcggaggtt gcagtgagct 
gacagagtga gaccctgtct caggaaaaaa 
ccaaaatcaa tccctttagc tgcccaaatc 
gagaggaaaa gggacttgcc caaagtcacc 
tggacttctt gcctccaaaa gctctttata 
tctgtttagg gccaagcaat atgctaagtg 
aaacagctct agttgggagg ctcaatgatt 
cagagaggtt gaggattagc ctagaaccac 
aacccgggtc tgacctaaga gctcccagcc 
cacctacgca gatgtcggct cgaatccact 
tttaaaaatt aggcagcaaa tagaaaatat 
gagcccaaca aaggatctta ggcaaaagaa 
tgaactggct gggctttggg aagctacaga 
cccaactcag gctggccccc accctgcagg 
gtgagatgcc ggctgcggga gtttcagcct 
gcccctgggt agcagcgcca gagtgggctg 
agagatgttg atttgggggt actccctcag 
gaggggaggg ggaaaggggg gaggaaggca 
tacaaaaagt ggcttattgc atttttctga 
tattcagaga aagcctgaag atataaagag 
tccccgcccc agcatctggc actgtgtggg 
gcgtagtatc tccgtgaaca tccggttgaa 
agttattaat taaaaaacaa aaagcaaaac 
tgagctccca ccacgaggga ggcctgacgt 
gcatgggact 



ctacactgga aatgatgggg agctctctac 27 000 
acctccttgc gccaggcatg atggctcatg 27060 
gcgggtgtat cacttgaggt gaggagttca 27120 
atctctacca .aaaaataaaa aattagccgg 27180 
acttgggagg ctgagttggt agaattgctt 27240 
gagattgtgc cattgcactc cagcttggat 27300 
aaaaaaaaac aaaaaaaacc ttgttctaag 27360 
acacagttta cagatggaga' aacagtttta 27420 
cagagaatgg cagagcctga actagccttc 27480 
ataaaatata attttaaata aaaatagtta 27540 
ccgtccagcc actgtgtcat ttacgtctcc 27600 
atcccaattt tacagataag gaaacaggtc 27660 
acagctagga aatcctggag ccaggatttg 27720 
gccgtgatat atcagcttat gtcatcctga 27780 
ttgcctgagc attgtctcag agaaatctaa 27840 
atttgactgc tagagatgca atgggactgg 27900 
atccaagttg ttggcctcag .caactattac 27960 
gggatgagaa gacctggtgg atcaggtggg 28020 
aagtaggaaa agtccagggt cataggccca 28080 
ccggggctgg accagagggc • aggaggggac 28140 
agtggcctgg gcccctgcgg gggagctttc 28200 
ccctgccttt acacagaatt tgtgggggat 28260 
gtgagtgcat ctgaattttt tttttttttt 28320 
ttactctatc agcacgtgca gaccttttcc 28380 
gaaagtgaag aaaaaccacc ggaaatccca 28440 
cgatcacgaa atgagcgctt gtttttgaag 28500 
caacctttct gactttattt ttcccacgaa 28560 
accgaaaaaa caaaaaaccc agcaagtgtt 28620 
cactggatcc tcccggcagc cgatgaggct 28680 

28690 



<210> 2 
<211> 2476 
<212> DNA 
<213>'Hoino sapiens 



<400> 2 

atggggtggc tttgctctgg gctcctgttc 
gcaagctctg ggaacatgaa ggtcttgcag 
atctcta.ctt gcgagtggaa gatgaatggt 
ttgtacpagc tggtttttct gctctccgaa 
ggcgcggggt gcgtgtgcca cctgctcatg 
ctggacctgt gggctgggca gcagctgctg 
gtgaaaccca gggccccagg aaacctgaca 
ctgacctgga gcaacccgta tccccctgac 
gtcaacattt ggagtgaaaa cgacccggca 
gaaccctccc tccgcatcgc agccagcacc 



cctgtgagct gcctggtcct gctgcaggtg 60 
gagcccacct gcgtctccga ctacatgagc 120 
cccaccaatt gcagcaccga gctccgcctg 180 
gcccacacgt gtatccctga gaacaacgga 240 
gatgacgtgg tcagtgcgga taactataca 300 
tggaagggct ' ccttcaagcc cagcgagcat 360 
gttcacacca atgtctccga cactctgctg 420 
aattacctgt ataatcatct cacctatgca 480 
gatttcagaa tctataacgt gacctaccta 54 0 
ctgaagtctg ggatttccta cagggcacgg 600 
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gtgagggcct gggctcagtg ctataacacc 
tggcacaact cctacaggga gcccttcgag 
tgcattgtca tcctggccgt ctgcctgttg 
gaatggtggg atcagattcc caacccagcc 
gatgctcagg ggtcacagtg ggagaagcgg 
cactggaaga attgtcttac caagctcttg 
gatgaagatc ctcacaaggc tgccaaagag 
tggtgcccag tggagatcag caagacagtc 
tgtgtggagt tgtttgaggc cccggtggag 
aaagggagct tctgtgcatc gcctgagagc 
ggcattgtgg cccggctaac agagagcctg 
ggcttttgcc agcaggacat gggggagtca 
gctcacatgc cctgggatga gttcccaagt 
aaggagcagc ctctccacct ggagccaagt 
aacctgactt gcacagagac gcccctcgtc 
agcaactccc tgagccagtc accgtgtccc 
agacacctgg aggaagtaga acccgagatg 
actgtgcccc aacctgagcc agaaacctgg 
catggggcag ctgcagcccc cgtctcggcc 
gcggtggagc agggtggcac ccaggccagt 
gctggttaca aggccttctc aagcctgctt 
gggtttgggg ctagcagtgg ggaagagggg 
tgccctgggg accctgcccc agtccctgtc 
ccacctcgca gtccgcagag ctcacatctc 
gagccggggg aaaaggtaga ggacatgcca 
gacccccttg tggacagcct gggcagtggc 
tgcggccacc tgaaacagtg tcatggccag 
agtccttgct gtggctgctg ctgtggagac 
gccccagacc cctctccagg tggggttcca 
gcaccctcgg gcatctcaga gaagagtaaa 
aatgctcaga gctcaagcca gacccccaaa 
tacatgaggg tctctt 



acctggagtg agtggagccc cagcaccaag 660 
cagcacctcc tgctgggcgt cagcgtttcc 720 
tgctatgtca gcatcaccaa gattaagaaa 780 
cgcagccgcc tcgtggctat aataatccag 840 
tcccgaggcc aggaaccagc caagtgccca 900 
ccctgttttc tggagcacaa catgaaaagg 960 
atgcctttcc agggctctgg aaaatcagca 1020 
ctctggccag agagcatcag cgtggtgcga 1080 
tgtgaggagg aggaggaggt agaggaagaa 114 0 
agcagggatg acttccagga gggaagggag 1200 
ttcctggacc tgctcggaga ggagaatggg 1260 
tgccttcttc caccttcggg aagtacgagt 1320 
gcagggccca aggaggcacc tccctggggc 1380 
cctcctgcca gcccgaccca gagtccagac 1440 
atcgcaggca accctgctta ccgcagcttc 1500 
agagagctgg gtccagaccc actgctggcc 1560 
ccctgtgtcc cccagctctc tgagcca'acc 1620 
gagcagatcc tccgccgaaa tgtcctccag 1680 
cccaccagtg gctatcagga gtttgtacat 1740 
gcggtggtgg gcttgggtcc cccaggagag 1800 
gccagcagtg ctgtgtcccc agagaaatgt 1860 
tataagcctt tccaagacct cattcctggc 1920 
cccttgttca cctttggact ggacagggag 1980 
ccaagcagct ccccagagca cctgggtctg 2040 
aagcccccac ttccccagga gcaggccaca 2100 
attgtctact cagcccttac ctgccacctg 2160 
gaggatggtg gccagacccc tgtcatggcc 2220 
aggtcctcgc cccctacaac ceccctgagg 2280 
ctggaggcca gtctgtgtcc ggcctccctg 2340 
tcctcatcat ccttccatcc tgcccctggc 2400 
atcgtgaact ttgtctccgt gggacccaca 2460 

2476 



<210> 3 
<2il> 825 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Met Gly Trp Leu Cys Ser Gly Leu Leu Phe Pro Val Ser Cys Leu Val 
15 10 15 

Leu Leu Gib Val Ala Ser Ser Gly Asn Met Lys Val Leu Gin Glu Pro 
20 ' 25 30 

Thr Cys Val Ser Asp Tyr Met Ser He Ser Thr Cys Glu Trp Lys Met 
35 40 45 
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Asn Gly Pro Thr Asn Cys Ser Thr Glu Leu Arg Leu Leu Tyr Gin Leu 
50 55 ' 60 

Val Phe Leu Leu Ser Glu Ala His Thr Cys He Pro Glu Asn Asn Gly 
65 70 * 75 80 

Gly Ala Gly Cys Val Cys His Leu Leu Met Asp Asp Val Val Ser Ala 
85 90 95 

Asp Asn Tyr Thr Leu Asp Leu Trp Ala Gly Gin Gin Leu Leu Trp-Lys 
100 • 105 110 

Gly Ser Phe Lys Pro Ser Glu His Val Lys Pro Arg Ala Pro Gly Asn 
115 120 125 

Leu Thr Val His Thr Asn Val Ser Asp Thr Leu Leu Leu Thr Trp Ser 
130 135 140 

Asn Pro Tyr Pro Pro Asp Asn Tyr Leu Tyr Asn His Leu Thr Tyr Ala 
145 150 155 160 

Val Asn He Trp Ser Glu Asn Asp Pro Ala Asp Phe Arg He Tyr Asn 
165 170 175 

Val Thr Tyr Leu Glu Pro Ser Leu Arg He Ala Ala Ser Thr Leu Lys 
180 185 190 

Ser Gly He Ser Tyr Arg Ala Arg Val Arg Ala Trp Ala Gin Cys Tyr 
195 200 205 

Asn Thr Thr Trp Ser Glu Trp Ser Pro Ser Thr Lys Trp His Asn Ser 
210 215 220 

Tyr Arg Glu Pro Phe Glu Gin His Leu Leu Leu Gly Val Ser Val Ser 
225 230 . 235 240 

Cys He Val He Leu Ala Val Cys Leu Leu Cys Tyr Val Ser He Thr 
245 250 255 

Lys He Lys* Lys Glu Trp Trp Asp Gin He Pro Asn Piro Ala Arg Ser 
260 265 270 

Arg Leu Val Ala He He He Gin Asp Ala Gin Gly Ser Gin Trp Glu 
275 280 285 

Lys Arg Ser Arg Gly Gin Glu Pro Ala Lys Cys Pro His Trp Lys Asn 
290 295 300 
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Cys Leu Thr Lys Leu Leu Pro Cys Phe Leu Glu His Asn Met Lys Arg 
305 310 315 320 

Asp Glu Asp Pro His Lys Ala Ala Lys Glu Met Pro Phe Gin Gly Ser 
325 330 335 

Gly Lys Ser Ala Trp Cys Pro Val Glu lie Ser Lys Thr Val Leu Trp 
340 345 350 

Pro Glu Ser lie Ser Val Val Arg Cys Val Glu Leu Phe Glu Ala Pro 
355 360 365 

Val Glu Cys Glu Glu Glu Glu Glu Val Glu Glu Glu Lys Gly Ser Phe 
370 375 380 

Cys Ala Ser Pro Glu Ser Ser Arg Asp Asp Phe Gin Glu Gly Arg Glu 
385 390 395 4O0 

Gly He Val 7U.a Arg Leu Thr Glu Ser Leu Phe Leu Asp Leu Leu Gly 
405 410 415 

Glu Glu Asn Gly Gly Phe Cys Gin Gin Asp Met Gly Glu Ser Cys Leu 
420 425 430 

Leu Pro Pro Ser Gly Ser Thr Ser Ala His Met Pro Trp Asp Glu Phe 
435 440 445 

Pro Ser -Ala Gly Pro Lys Glu Ala Pro Pro Trp Gly Lys Glu Gin Pro 
450 455 460 . • 

Leu His Leu Glu Pro Ser Pro Pro Ala Ser Pro Thr Gin Ser Pro Asp 
465 470 475 480 

Asn Leu Thr Cys Thr Glu Thr Pro Leu Val lie Ala Gly Asn Pro Ala 
485 490 495 

Tyr Arg Ser Phe Ser Asn Ser Leu Ser Gin Ser Pro Cys Pro Arg Glu 
500' '505 510 

Leu Gly Pro Asp Pro Leu Leu Ala Arg His Leu Glu Glu Val Glu Pro 
515 520 525 

Glu Met Pro Cys Val Pro Gin Leu Ser Glu Pro Thr Thr Val Pro Gin 
530 535 540 



Pro Glu Pro Glu Thr Trp Glu Gin He .Leu Arg Arg Asn Val Leu Gin 
545 550 555 560 
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His Gly Ala Ala Ala Ala Pro Val Ser Ala Pro Thr Ser Gly Tyr Gin 
565 570 575 

Glu Phe Val His Ala Val Glu Gin Gly Gly Thr Gin Ala Ser Ala Val 
580 585 590 

Val Gly Leu Gly Pro Pro Gly Glu Ala Gly Tyr Lys Ala Phe Ser Ser 
595 600 605- 

Leu Leu Ala Ser Ser Ala Val Ser Pro Glu Lys Cys Gly Phe Gly Ala 
610 615 620 

Ser Ser Gly Glu Glu Gly Tyr Lys Pro Phe Gin Asp Leu lie Pro Gly 
625 630 635 640 

Cys Pro Gly Asp Pro Ala Pro Val Pro Val Pro Leu Phe Thr Phe Gly 
645 650 655 

Leu Asp Arg Glu Pro Pro Arg Ser Pro Gin Ser Ser His Leu Pro Ser 
660 665 670 

Ser Ser Pro Glu His Leu Gly Leu Glu Pro Gly Glu Lys Val Glu Asp 
675 -680 685 

Met Pro Lys Pro Pro Leu Pro Gin Glu Gin Ala Thr Asp Pro Leu Val 
690 695 700 

Asp Ser Leu Gly Ser Gly lie Val Tyr Ser Ala Leu Thr Cys His Leu 
705 710 715 720 

Cys Gly His Leu Lys Gin Cys His Gly Gin Glu Asp Gly Gly Gin Thr 
725 730 735 

Pro Val Met Ala Ser Pro Cys Cys Gly Cys Cys Cys Gly Asp Arg Ser 
740 745 750 

Ser Pro Pro Thr Thr Pro Leu Arg Ala Pro Asp Pro Ser Pro Gly Gly 
755 760 765 

Val Pro Leu Glu Ala Ser Leu Cys Pro Ala Ser Leu Ala Pro Ser Gly 
770 775 780 

lie Ser Glu Lys Ser Lys Ser Ser Ser Ser Phe His Pro Ala Pro Gly 
785 790 795 800 



Asn Ala Gin Ser Ser Ser Gin Thr Pro Lys lie Val Asn Phe Val Ser 
805 810 815 
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Val Gly Pro Thr Tyr Met Arg Val Ser 
820 825 



<210> 4 

<211> 15 

<212> DNA 

<'213> Homo sapiens 

<400> 4 

ttgcaccact gcact 15 

<210> 5 
<211> 15 
<212> DNA 

<213> Homo sapiens * 



<210> 6 

<211> 15 

•<212> DNA 

<213> Homo sapiens 

<400> 6 • 

ttttgtgcta ttccc. 15 

<210> 7 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 5 



ttgcaccgct gcact 



15 



<400> 7 



ttttgtgtta ttccc 



15 



<210>- 8 



<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 8 



ctgggccgct caggc 



15 
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<210> 9 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ctgggcctct caggc 15 



<210> 10 
<211> 15 
<212> DNA 

<213> Homo sapiens . - , 
<400> 10 

taagcctgcg ctgga 15 



<2ip> 11 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 11 

taagcctccg ctgga 15* 

<210> 12 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 12 

agaacaacgg aggcg " .15 



<210> 13 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 13 

agaacaatgg aggcg . 15 



<210> 14 
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<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 14 



cggaggcgcg gggtg. 



15 



<210> 15 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 15 

cggaggcacg gggtg 15 

<210> 16 
<211> 15 
<212> DNA 

<213> Homo sapiens 



<210> 17 

<211> 15 ■ 

<212> DNA 

<213> Homo sapiens 

<40P> 17 ' 

gtgcggacaa ctata 15 

<210> 18 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 16 



gtgcggataa cta-ta 



15 



<400> 18 



cggagtgcgg caggg 



15 



<210> 19 



<211> 15 



<212> DNA 



<213> Homo sapiens 
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<400> 19 



cggagtgtgg caggg 



15 



<210> 20 



<211> 15 • 

<212> DNA 

<213> Homo sapiens 



<400> 20 



gcctgggctg agggt 



15 



<210> 21 

<211>. 15 

<212> DNA 

<213> Homo sapiens 

<400> 21 

gcctgggttg agggt 15 

<210> 22 ■ 
<211> 15 
<212> DNA 

<213> Homo sapiens. 



<210> 23 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 23 

tggggtgagc agggg . 15 



<400> 22 



tggggtgggc agggg 



15 



<210> 24 



<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 24 



ttctcccgca gtgaa 



15 
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<210> 25 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 25 

ttctcccaca gtgaa 15 



<210> 26 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 26 

•gtgaaaacga cccgg 15 



<210> 27 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 27 * 

gtgaaaatga cccgg 15 



<210> 28 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 28 

ggcaagccct ggggc 15 



<210> 29 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 29 

ggcaagctct ggggc 15 



<210> 30 
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<211> 15 

<212> DNA 

<213> Homo sapiens 



<400> 30 



gccctggggc tggat 



15 



<210> 31 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 31 

gccctggagc tggat 15 

<210> 32 

<211> 15 . 

<212> DNA 

<213> Homo sapiens 



<210> 33 

<211> 15 

<212> DNA 

<213> Honio sapiens 

<400> 33. . 

atagcaattc ccagg 15 



<210> 34 

<211> 15 * 

<212> DNA 

<213> Homo sapiens 



<400> 32 



atagcaaatc ccagg 



15 



<400> 34 



gctctgccct aggca 



15 



<210> 35 
<211> 15 
<212> DNA 



<213> Homo sapiens 
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<4 00> 35. 

gctctgcact aggca 15 



<210> 36 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 36 

cccccacccc' tcaca 15 



<210> 37 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 37 

cccccactcc tcaca 15 



<210> 38 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 38 

tccctccgca tcgca 15 



<210> 39 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 39 

tccctccaca tcgca 15 



<210> 40 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 40 

cacctgctgt ggtgt 15 
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<210> 41 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 41 

cacctgccgt ggtgt 15 



<210> 42 * 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 42 

atgtctgaag tagac 15 



<210> 43 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 43 

atgtctgcag tagac 15 



<210> 44 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 44 

tgaccaacct ttgct. 15 

<210> 45 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 45 

tgaccaactt ttgct 15 



<210> 46 
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<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 46 

cctgttttct ggagc 15 



<210> 47 

<211> 15 

<212> DNA • 

<213> Homo sapiens 

<400> 47 

cctgtttcct ggagc 15 



<210> 48 

<211> 15 

<212> DNA 

<213> -Homo sapiens 

<400> 48 

tggacctgct cggag 15 



<210> 49 

<211>- 15 

<212> DNA 

<213> Homo sapiens 

<400> 49 

tggaccttct cggag 15 



<210> 50 

<211> 15 

<212> DNA 

<213> Homo sapiens 

'■<400> 50 

agtcatgcct tcttc 15 



<210> 51 

<211> 15 

<212> DNA 

<213> Homo sapiens 



24 



wo 01/04270 

<400> 51 

agtcatgtct tcttc 

<210> 52 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 52 

gccttcttcc acctt 

<210> 53 

<211> 15 

.<212> DNA 

<213> Homo sapiens 

<400> 53 

gccttctccc acctt 

<210> 54 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 54 

cagcccccgt ctcgg 

<210> 55 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 55 

cagcccctgt ctcgg 

<210> 56 
<211> 15 

<212> DNA 

<213> Homo sapiens 
<400> 56 

ggagtttgta catgc 



PCT/USOO/19094 



15 



15 



15 



15 



15 
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<210> 57 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 57 

ggagtttata catgc 15 



<210> 58 

<211> 15 

<2:12> DNA 

<213> Homo sapiens 

<400> 58 

cagctcccca gagca 15 



<210> 59 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 59 

cagctcctca gagca 15 



<210> 60 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 60 

agacaggtcc tcgcc 15 



<210> 61 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 61 

agacagggcc tcgcc 15 



<210> 62 
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<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 62 

ctgcccctgg caatg 15 



<210> 63 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 63 

ctgcccccgg caatg 15 



<210> 64 

<211> 15 

<2.12> DNA 

<213> Homo sapiens 

<400> 64 

aggtgcatgt octet 15 



<210> 65 
<211> 15 
<212> DNA 
<213> Homo 



sapiens 



<400> 65 

aggtgcacgt octet 



<210> 66 
<211> 15 

<212> DNA 

<213> Homo sapiens 

<40.0> 66 

gtgcatgtcc tcttg 15 



<210> 67 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 67 

gtgcatgccc tcttg 15 



<210> 68 

<211> 15 

<:212> DNA 

<213> Homo sapiens 

<400> 68- 

ggcttatcca -tgcct 15 



<210> 69 

<211>'l5 

<212> DNA 

<213> Homo sapiens 

<400> 69 

ggcttattca tgcct 15 



<210> 70 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 70 

agccaggctg gcaga 15 



<210> 71 

<211>" 15 

<212> DNA 

<213> Homo sapiens 

<400> 71 

agccagggtg gcaga 15 



<210> 72 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 72 

ggcccacatg gaggc 15 
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<210> 73 

<211> 15 . 

<212> DNA 

<213> Homo sapiens 

<400> 73 ' * 

ggcccacgtg gaggc 15 



<210> 74 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 74 

taacacagcc atcaa 15 



<210> 75 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 75 

taacacaacc atcaa . . 15 



<210> 76 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 76 

taatgctcgt ctgtg 15 



<210> 77 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 77 

taatgcttgt ctgtg 15 



<210> 78 
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<211> 15 



<212> DNA 

<213> Homo sapiens 



<400> 78 



acttgccgtc tgggt 



15* 



<210> 79 

<211> 15 

<212> DNA - 

<213> Homo sapiens 

<400> 79 

acttgccatc tgggt 15 

<210> 80 

<211> 15 

<212> DNA 

<213> Homo sapiens 



<210> 81 

<211> 15 

<212> DNA 

<213> Homo sapiens 

i:400> 81 

ggctggagtg cagtg 15 

<210> 82 

<211> 15 

<212>- DNA 

<213> Homo sapiens 



<400> 80 



ctgagattgc 



accac 



15 



<400> 82 



ctgagattgc 



accgc 



15 



<210> 83 
<211> 15 
<212> DNA 



<213> Homo 



sapiens 
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<400> 83 

ggctggagtg cagcg 15 



<210> 84 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 84 

ctgtgctttt gtgct 15 



<210> 85 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 85 

accaagggga atagc ' 15 



<210> 86 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 86 

ctgtgctttt gtgtt 15 



<210> 87 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 87 

accaagggga ataac 15 



<210> 88 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 88 

gagttcctgg gccgc 15 



31 



wo 01/04270 



PCTAJSOO/19094 



<210> 89 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 89 

ggagcagcct gagcg 15 



<210> 90 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 90 

gagttcctgg gcctc . ' 15 



<210> 91 ' 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 91 

ggagcagcct gagag 15 



<210> 92 

<211> 15 

<212> DNA * . . 

<213> Homo sapiens 

<400> 92 

tccgagtaag cctgc 15 



<210> 93 

<211> 15- 
*<212> DNA 
<213> Homo sapiens 

<400> 93 

tccagctcca gcgca 15 



<210> 94 
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<211> 15 

<212> DNA 

<213> Homo sapiens 

.<400> 94 

tccgagtaag cctcc 15 



<210> 95 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 95 

tccagctcca gcgga 15 



<2ia> 96 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 96 

tccctgagaa caacg 15 



<210> 97 

<211> 15 . 

<212> DNA 

<213> Homo sapiens 

<400> 97 

accccgcgcc tccgt 15 



<210> 98 
<211> 15 
<212> DNA 

,<213> Homo sapiens . - • 

<400> 98 

tccctgagaa caatg 15 



<210> 99 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 99 

accccgcgcc tccat 15 



<210> 100 
<211> 15 
<212> DNA 

<213> Homo sapiens . 
<400> 100 

gaacaacgga ggcgc 15 



<210> 101 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 101 

cacacgdacc ccgcg 15 



<210> 102 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 102 

gaacaacgga ggcac 15 



<210> 103 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 103 

cacacgcacc ccgtg 15 



<210> 104 

<211> 15* 

<212> DNA 

<213> Homo sapiens 

<400> 104 

tggtcagtgc ggata 15 
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<210> 105. 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 105 

ccagtgtata gttat 15 



<210> 106* 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 106 

gcagggcgga gtgcg 15 



<210> 107 * 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 107 

agccacccct gccgc * 15 



<210> 108 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 108 

gcagggcgga gtgtg 15 



<210> 109 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 109 

agccacccct gccac 15 



<210> 110 
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<211> 15 
<212> DNA 

<213> Homo' sapiens ■ 
<400> 110 

acagctgcct gggct 15 

<210> 111 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 111 

caccccaccc tcagc 15 



<210> 112 • ■ 

<211> 15 . 

<212> DNA 

<213> Homo sapiens 

<400> 112 

acagctgcct gggtt 15 



<210> 113 

<211> 15 ■ 

<212> DNA 

<213> Homo sapiens 

<400> 113 

acagctgcct gggtt 15 



<210> 114 

<211> 15 ' . . 

<212> DNA 

<213> Homo sapiens 

<40b> 114 

tgagggtggg gtggg 15 



<210> 115 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 115 

cctcctcccc tgccc 15 



<210> 116 

<211>.15 

<212> DNA 

<213> Homo sapiens 

<400> 116 

tgagggtggg gtgag 15 



<210> 117 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 117 

cctcctcccc tgctc 15 



<210> 118 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 118 

ggccgcttct cccgc 15 



<210> 119 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 119 

ctgggtttca ctgcg 15 



<210> 120 

<211> .15 . 

<212> DNA 

<213> Homo sapiens 

<400> 120 

ggccgcttct cccac 15 
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<2iq> 121 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 121 

• ctgggtttca ctgtg 15 



<210> 122 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 122 

tttggagtga aaacg 15 



<210> 123 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> -123 

catctgccgg gtcgt .15 



<210> 124 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 124 

tttggagtga aaatg 1-5 



<210> 125 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 125 

catctgccgg gtcat 15 



<210> 126 



38 



wo 01/04270 



PCT/USOO/19094 



<211> 15 



<212> DNA 
<213> Homo 



sapiens 



<400> 126 



ctgggaggca 



agccc 



15 



<210> 127 

<21X> 15 

<212> DNA 

<213> Homo sapiens 

<400> 127 

tatccagccc caggg 15' 



<210> 128 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 128 

tggtcagtgc ggaca 15 



<210> 129 
<211> 15 

<212> DNA - . 

<213> Homo sapiens 

<400> 129 

ccagtgtata gttgt 15 



<210> 130. 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<4D0> 130 

ctgggaggca agctc 15 



<210> 131 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 131 

tatccagccc cagag 15 



<210> 132 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 132 

aggcaagccc tgggg 15 



<210> 133 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 133 

tttgctatcc agccc 15 



<210> 134 

'<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 134 

aggcaagccc tggag 15 



<210> 135 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 135 

tttgctatcc agctc 15 



<210> 136 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 136 

ggctggatag caaat 15 
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<210> 137 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 137 

ctagctcctg ggatt 15 



<210> 138 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 138 

ggctggatag caatt 15 



<210> 139 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 139 

ctagctcctg ggaat 15 



<210> 140' 

<211> 15 

<212>/DNA 

<213> Homo sapiens 

<400> 140 

cacctggctc tgccc 15 



<210> 141 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<4G0> 141 

gggacttgcc taggg 15 



<210> 142 
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<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 142 
cacctggctc tgcac 

<210> 143 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 143 
gggacttgcc tagtg 

<210> 144 
<211> 15 ' 
<212> DNA 
<213> Homo sapiens 

<400> 144 
cctggccccc caccc 

<210> '145 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 145 
ctctgatgtg agggg • 

<210> 146 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 146 
cctggccccc cactc 

<210> 147 

<211> 15 

<212> DNA 

<213> Homo sapiens 



PCT/USOO/19094 



15 



15 



15 



15 
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<400> 147 ■ 

ctctgatgtg aggag 15 



<210> 148 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 148 

gaaccctccc tccgc 15 

<210> 149 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 149 

gctggctgcg atgcg 15 



<210> 150 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 150 

gaaccctccc tccac 15 



<210> 151 

<211> 15 • 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gctggctgcg atgtg 15 



<210> 152 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 152 

tagatacacc tgctg 15 
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<210> 153 
<211> 15 
<212> DNA 

<213> HomQ * sapiens 
<400> 153 

gcagatacac cacag 15 



<210> 154 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 154 

tagatacacc tgccg 15 



<210> 155 
<211> 15 
<212> DNA 

<2'13> Homo sapiens 
<400> 155 

gcagatacac cacgg 15 



<210> 156 
<211> 15 

<212> DNA . . ' . 

<213> Homo sapiens 

<400> 156 

gaaggcatgt ctgaa 15 

<210> 157 

<2il> 15 • 

<212> DNA * . 
<213> Homo sapiens 

<400> 157 

atggctgtct act to 15 



<210> 158 
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<211> 15 

<212> *DNA 

<213> Homo sapiens 

<400> 158 

gaaggcatgt ctgca 15 



<210> 159 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 159 

atggctgtct actgc 15 



<210> 160 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 160 

gaaccctgac caacc 15 



<210> i61 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400.> 161 . 

tgcaaaagca aagga 15 



<210> 162 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 162 

gaaccctgac caatc 15 



<210> 163 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 163 

tgcaaaagca aagaa 15 



<210> 164 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 164 

tcttgccctg ttttc 15 



<210> 165 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 165 

tgttgtgctc cagaa 15 



<210> 166 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 166 

tcttgccctg tttcc 15 



<210> 167 

<211> 15 

<212> DNA 

<213>* Homo sapiens 

<400> 167 

tgttgtgctc cagga 15 



<210> 168 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 168 

tgttcctgga cctgc 15 
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<210> 169 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 169 

tctcctctcc gagca 15 



<210> 170 

<211> 15 

<212> DNA 

<213> Homo sapiens 

-<400>170 

tgttcctgga ccttc 15 



<210> 171 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 171 

tctcctctcc gagaa 15 



<210> 172 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 172 

tgggggagtc atgcc 15 



<210> 173 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 173 

aaggtggaag aaggc 15 



<210> 174 
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<211> 15 



<212> DNA 



<213> Homo 



sapiens 



<400> 174 



tgggggagtc 



atgtc 



15 



<210> 175 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 175 

aaggtggaag aagac 15 

<210> 176 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 176 . 

agtcatgcct tcttc 15 

■<210> 177 
<211> 15 ■ 
<212> DNA 
<213> Homo sapiens 

<400> 177 

ttcccgaagg tggaa 15 

<210> 178 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 178 

agtcatgcct tctcc 15 

<210> 179 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 179 



ttcccgaagg tggga 



15 



<210> 180 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 180 

cagctgcagc ccccg 15 

<210> 181 
<211> 15 . 



<210> 182 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 182 

cagctgcagc ccctg . 15 
<210> 183 



<212> DNA 

<213> Homo sapiens 



<400> 181 



tgggggccga gacgg 



15 



<211> 15 



<212> DNA 

<213> Homo sapiens 



<4aO> 183 



tgggggccga gacag 



15 



<210> 184 



<211> 15 



<212> DNA 



<213> Homo sapiens 



<400> 184 
ctatcaggag tttgt 



15 
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<210> 185 

<211> 15 

<212> DNA' 

<213> Homo sapiens 

<400> 185 

tccaccgcat gtaca 15 

<210> 186 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 186 

ctatcaggag tttat 15 



<210> 187 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 187 

tccaccgcat gtata 15 



<210> 188 . 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 188 

cccaagcagc tcccc 15 



<210> 189 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 189 

cccaggtgct ctggg 15 



<210> 190 
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<211> 15 
<212> DNA 



<213> Homo 



sapiens 



<400> 190 



cccaagcagc 



tcctc 



15 



<210> 191 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 191 

cccaggtgct ctgag 15 

<210> 192 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 192 

ctgtggagac aggtc .15 

<210> 193 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 193 

gtagggggcg aggac 15 

<210> 194 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 194 

ctgtggagac agggc 15 

<210> 195 

<211> 15 

<212> DNA 

<213> Homo sapiens 
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<400> 195 

gtagggggcg aggcc 15 



<210> 196 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 196 

tccatcctgc ccctg 15 



<210> 197 

<211> 15 • • 

<212> DNA 

<213> Homo sapiens 

<400> 197 . • 

tctgagcatt gccag 15 



<210> 198 

<211-> 15 . 

<212> DNA 

<213> Homo sapiens 

<400> 198 

tccatcctgc ccccg 15 



<210> 199 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 199 

tctgagcatt gccgg • 15 



<210> 200 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 200 

tctcttaggt gcatg 15 



52 



wo 01/04270 



PCTAJSOO/19094 



<210> 201 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 201 

gcaacaagag gacat 15 



<210> 202 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 202 

tctcttaggt gcacg 15 



<210> 203 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 203 

gcaacaagag gacgt 15 



<210> 204 

<211> 15 

<212> DNA . . 

<213> Homo sapiens 

<40b> 204 

tcttaggtgc atgtc . • 15 



<210> 205 
♦<211> 15 
<212> DNA 
<213> Homo sapiens 

<400> 205 

cagcaacaag aggac 15 



<210> 206 
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<211> 15 



<212> DNA 



<213> Homo 



sapiens 



<400> 206 



tcttaggtgc 



atgcc 



15 



<210> 207 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 207 

cagcaacaag agggc 15 

<210> 208 
<211> 15 
<212> DNA 

<213> Homo sapiens 
<400> 208 

gactagggct tatcc . . 15 

<210> 209 
<211> 15 
<212> DNA 

<213> Homo sapiens 

<400> 209 . 
tttcccaggc atgga 15 

<210> 210 

<211> 15 . . 

<212'> DNA 

<213> Homo sapiens 

<400> 210 

gactagggct tattc 15 

<210> 211 ' - 

<211> 15 
<212> DNA 

<213> Homo sapiens 
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<400> 211 

tttcccaggc atgaa 15 



<210> 212 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 212 

gaaggcagcc aggct 15 



<210> 213 . 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 213 

tggaaatctg ccagc 15 



<210> 214 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 214 

gaaggcagcc agggt 15 



<210> 215 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 215 

tggaaatctg ccacc 15 



<210> 216 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 216 

gatcatggcc cacat 15 
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<210> 217 

<211> 15 ' . 

<212> DNA 

<213> Homo sapiens 

<4.00> 217 

aggtgggcct ccatg 15 



<210> 218 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 218 

gatcatggcc cacgt 15 



<210> 219 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 219 

aggtgggcct ccacg 15 



<210> 220 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 220 

agaaactaac acagc 15 



<210> 221 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 221 

attcccttga tggct 15 



<210> 222 
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<211> 15 



<212> DNA 
<213> Homo 



sapiens 



<400> 222 



. agaaactaac 



acaac 



15 



<210> 223 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 223 

attcccttga tggtt 15 



<210> 224 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 224 

gttgagtaat gctcg 15 

<210> 225 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 225 

aaaacacaca gacga ,15 

<210> 226 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 226 

gttgagtaat gcttg 15 ■ 

<210> 227 

<211>*15 

<212> DNA 

<213> Homo sapiens 
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<400> 227 



aaaacacaca gacaa 



15 



<210> 228 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 228 

taagaaactt gccgt 15 

<210> 229 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 229 

acccaaaccc agacg 15 

<210> 230 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 230 

taagaaactt gccat 15 

<210> 231 . 

<211> 15 * 

<212> DNA 

<213> Homo sapiens 

<400> 231 

acccaaaccc agatg 15 

<210> 232 

<211> 10 

<212> DNA 

<213> Homo sapiens 



<400> 232 



agattgcacc 



10 
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<210> 233 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 233 

tggagtgcag 10 



<210> 234 
<211> 10 . 
<212> DNA 
<213> Homo sapiens 

<400> 234 

tgcttttgtg 10 



<210> 235 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 235 

aaggggaata 10 



<210> 236 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 236 

ttcctgggcc 10 



<210> 237 
<211> 10 

<:212> DNA 

<213> Homo sapiens 

<400> 237 

gcagcctgag 10 



<210> 238 



59 



wo 01/04270 PCT/USOO/19094 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 238 

gagtaagcct 10 



<210> 239 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 239 

agctccagcg 10 



<210> 240 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 240 

ctgagaacaa 10 



<210> 241 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 241 

ccgcgcctcc 10 



<210> 242 • 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 242 

caacggaggc 10 



<210> 243 
<211> 10 
<212> DNA 

<213> Homo sapiens 
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<400> 243 
acgcaccccg 

<210> 244 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 244 
tcagtgcgga 

<210> 245 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 245 
gtgtatagtt 

<210> 246 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 246 
gggcggagtg 

<210> 247 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 247 
cacccctgcc 

<210> 248 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 248 
gctgcctggg 



10 



10 



10 



10 



10 



10 
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<210> 249 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 249 

cccaccctca 10 



<210> 250 
<211> 10 

<212> DNA 

<213> Homo sapiens • 
<400> 250 

gggtggggtg 10. 



<210> 251 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 251 

cctcccctgc 10 



<210> 252 

<211> 10 • 

<212> DNA 

<213> Homo sapiens 

<400> 252 

cgcttctccc 10 



<210> 253 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 253 

ggtttcactg 10 



<210> 254 
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<211> 10 



<212> DNA 



<213> Homo 



sapiens 



<400> 254 



ggagtgaaaa 



10 



<210> 255 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 255 

ctgccgggtc 10 

<210> 256 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 256 

ggaggcaagc 10 

<210> 257 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 257 

ccagccccag 10 

<210> 258 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 258 

caagccctgg 10 

<210> 259 

<211> 10 , 

<212> DNA 

<213> Homo sapiens 
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<400> 259 

gctatccagc 10 



<210> 260 

<211> 10 

<2l2> DNA 

<213> Homo sapiens 

<400> 260 

tggatagcaa 10 



<210> 261 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 261 

gctcctggga 10 



<210> 262 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 262 

ctggctctgc 10 



<210> 263 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 263 

acttgcctag 10 



<210> 264 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 264 

ggccccccac 10 
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<210> 265 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 265 

tgatgtgagg 10 



<210> 266 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 266 

ccctccctcc * 10 



<210> 267 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 267 

ggctgcgatg 10 



<210> 268 

<211> 10 . ' . 

<212> DNA 

<213> Homo sapiens 
<400> 268 

atacacctgc 10 



<210> 269 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 269 

gatacaccac 10 



<210> 270 
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<211> 10 
<212> DNA 
<213> Homo 



sapiens 



<400> 270 



ggcatgtctg 



10 



<210> 271 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 271 

gctgtctact 10 

<210> 272 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<4 0P> 272 

ccctgaccaa 10 

<210> 273 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 273 

aaaagcaaag 10 

<210> 274 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 274 

tgccctgttt 10 

<210> 275 

<211> 10 

<212> DNA 

<213> Homo sapiens 
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<400> 275 



tgtgctccag 



10 



<210> 276 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 276 

tcctggacct 10 

<210> 277 
<211> 10 
<212> DNA 

<213> Homo sapiens 
<400> 277 

cctctccgag . 10 

<210> 278 
<211> 10 
■<212> DNA 
<213> Homo sapiens 

<400> 278 

gggagtcatg 10 
<210> 279 



<210> 280 
<211>'l0 
<212> DNA 

<213> Homo sapiens 
<400> 280 

catgccttct 10 



<211> 10 
<212> DNA 

<213> Homo sapiens 



<400> 279 
gtggaagaag 



10 
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<210> 281 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 281 

ccgaaggtgg 10 



<210> 282 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 282 

ctgcagcccc 10 



<210> 283 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 283 

gggccgagac 10 



<210> 284 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 284 

tcaggagttt 10 



<210> 285 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 285 

accgcatgta 10 



<210> 286 
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<211> 10 



<212> DNA 
<213> Homo 



sapiens 



<400> 286 



aagcagctcc 



10 



<210> 287 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 287 

aggtgctctg 10 

<210> 288 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 288 

tggagacagg 10 

<210> 289 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 289 

gggggcgagg 10 

<210> 290 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 290 

atcctgcccc 10 

<210> 291 

<211> 10 

<212> DNA 

<213> Homo sapiens 
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<400> 291 

gagcattgcc 10 



<210> 292 

<211> 10 

<212> DMA 

<213> Homo sapiens 

<400> 292 

cttaggtgca 10 



<210> 293 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 293 

acaagaggac 10 



<210> 294 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 294 

taggtgcatg 10 



<210> 295 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 295 

caacaagagg 10 



<210> 296 . 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 296 

tagggcttat 10 
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<210> 297 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 297 

cccaggcatg Xq 



<210> 298 
<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 298 

ggcagccagg 10 



<210> 299 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 299. 

aaatctgcca 10 



<210> 300 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 300 

catggcccac 10 



<210> 301 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 301 

tgggcctcca • 10 



<210> 302 
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<211> 10 



<212> DNA 
<213> Homo 



sapiens 



<400> 302 



aactaacaca 



10 



<210> 303 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<4p0> 303 

cccttgatgg • 10 
<210> 304 

<211> 10 . 

<212> DNA 

<213> Homo sapiens 

<400> 304 

gagtaatgct 10 

<210> 305 

<211> 10 

<212> DNA 

<213> Homo ' sapiens 

<400> 305 

acacacagac 10 

<210> 306 

<211> 10 

<212> DNA 

<213> Homo sapiens 

<400> 306 

gaaacttgcc 10 

<210> 307 

<211> 10 

<212> DNA 

<213> Homo sapiens 
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<400> 307 

caaacccaga 10 



<210> 308 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 308 

ccacagtcat cccgacacta gc 22 



<210> 309 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 309 

tattccagcc gtatccatgt gc 22 



<210> 310 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 310 

ccttggtgca tgtggtaaga gg 22 



<210> 311 
<211> 23 
<212> DNA 

<213> Homo sapiens 
<400> 311 

tttcaaaggt gggaggactg agg 23 



<210> 312 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 312- 

gcagtgagct gggattgtgc 20 
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<210> 313. 

<211> 23 

'<212> DNA 

<213> Homo sapiens 

<400> 313 

aactcccctt ctctgatgtg agg 23 



<210> 314 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 314 

tcacagttac agaggtggca age 23 



<210> 315 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 315 

ctgcctacct ggcagataca cc 22 



<210> 316 . 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 316 

agctgtcact ccacctcctt gg 22 



<210> 317 

<211> 24 

<212> DNA 

<213> Homo sapiens 



<400> 317 

aaagcctctg gtctgctaat gacc 24 
<210> 318 
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<211> 22 



<212> DNA 
<213> Homo 



sapiens 



<400> 318 



gggaggagat 



tcagagcact cc 



22 



<210> 319 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 319 

cagtccacgt ttccagaaca cc 22 

<210> 320 • 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<4,00> 320 

ggcttgggat aatggtgttg c 21 

<210> 321 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 321 

tacttcccga aggtggaaga agg 23 

<210> 322 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 322 

cagtggagat cagcaagaca gtcc 24 

<210> 323 

<211> 22 

<212> DNA 

<213> Homo sapiens 
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<400> 323 

gggcatctcg ggttctactt cc 22 



<210> 324 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 324 

gggaagtacg agtgctcaca tgc 23 



<210> 325 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 325 

cttatacccc tcttccccac tgc 23 



<210> 326 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 326 

tctctgagcc aaccactgtg c 21 



<210> 327 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 327 

ggctgagtag acaatgccac tgc 23 



<210> 328 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 328 

ctgtgtcccc agagaaatgt gg 22 
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<210> 329 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 329 

gactcagcaa caagaggaca tgc 23 



<210> 330 

<211> 24 . 

<212> DNA 

<213> Homo sapiens 

<400> 330 

ggcattgtct actcagccct tacc 24 



<210> 331 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 331 

acaagtc^ag gtgcccaagg 20 



<210> 332 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 332 

cccacataca tgagggtctc ttagg 25 



<210> 333 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 333 

attctgcctc cagcatcaac c . 21 



<210> 334 
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<211> 24 



<212> DNA 
<213> Homo 



sapiens 



<400> 334 



aacagagctt 



ccttaggttg atgc 



24 



<210> 335 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 335 

cctcagttcc ccactacctt age 23 

<210> 336 

<211> 18 

<212> DNA 

<213> Homo sapiens 

<400> 336 

gcgctggccc tcaacttt 18 

<210> 337 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 337 

gtccctggag atgggacctc 20 

<210> 338 

<211> 20 

<212>-* DNA 

<213> Homo sapiens 

<400> 338 

gcccccagat ctgtcctcac 20 

<210> 339 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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<400> 339 

ggaaaataca ggcggcttcc 20 



<210> 340 

<211> 22 

<212> DMA 

<213> Homo sapiens 

<400> 340 

ggctctgaat ctgtgtggtg ct 22 



<210> 341 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 341 

agccaggtga gaagccaggt . 2f0 



<210> 342 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 342 

ggcctgaaca ggacgaacaa 20 



<210> 343 . 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 343 

ggcaggattg ccattagagg . 20 



<210> 344 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 344 

tgagtcagtg gtttgacctc ca 22 
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<210> 345 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 345 

gcctctgtct cccctgcaac 20 



<210> 346* 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 346 

ccacttttgc catcgaccac 20 



<210> 347 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 347 

ctgccgtccc ttgaaggcta 20 



<210> 348 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 348 

ccctaccctc agggatttct ca .22 



<210> 34 9 • 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 349 

cccattctcc tctccgagca " 20 



<210> 350 
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<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 350 

atcagcgtgg tgcgatgtgt 

<210> 351 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 351 

acccagctct ctgggacacg 

<210> 352 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 352 

gggatgagtt cccaagtgca g 

<210> 353 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 353 

tggcaagcag gcttgagaag 

<210> 354 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 354 

ccccaacctg agccagaaac 

<210> 355 

<211> 20 

<212> DNA 

<213> Homo sapiens 
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20 



21 



20 
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<400> 355 

tgtccacaag ggggtctgtg 20 



<210> 356 
<211> 20 

<212> DNA 

<213> Homo sapiens 
<400> 356 

ggctagcagt. ggggaagagg 20 



<210> 357 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 357 

attgccaggg gcaggatg 18 



<210> 358 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 358 . * 

ccctgtcatg gccagtcctt 20 



<210> 359 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 359 

gcgacccagt gccctctact* 20 



<210> 360 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 360 

tgcatgtcct cttgttgctg ag 22 
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<210> 361 
<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 361 

caatgaccac cctccctgaa 20 



<210> 362 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 362 

cggctgtcaa ggggtgttc 19 



<210> 363 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 363 

ccaaacccag acggcaagtt 20 



<210> 364 

.<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 364 

aaacccctgg actccaagtg atcc 24 



<210> 365 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 365 

aagcgattct tctgcctcag cctc 24 



<210> 366 
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<211> 24 



<212> DNA 



<213> Homo 



sapiens 



<400> 366 



ggacagttgt 



tgtgtagctc accc 



24 



<210> 367 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 367 

ctatgttgcc caagctgacc tc 22 

<210> 368 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 368 

gccctcaact ttgcctgcac 20 

<210> 369 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 369 

agtccagtat tccagccgta tec 23 

<210> 370 

<211> 22 
<212>"DNA 

<213> Homo sapiens 
<4D0> 370 

tgatcgggaa gctggaagag tc - 22 ' 

<210> 371 

<211> 22 

<212> DNA 

<213> Homo sapiens 
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<400> 371 

cgtttcaaag gtgggaggac tg 22 

<210> 372 
<211> 21 , 

<212> DNA 

<213> Homo sapiens 
<400> 372 

cgaccaaaaa tctgggtggt g 21 



<210> 373 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 373 

caggaagcaa agggacttgc c 21 



<210> 374 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 374 

tcttaaacat: .ggtggggtca gc 22 



<210> 375 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 375 . 

catggaaatt gtgggcttgt g 21 



<210> 376 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 376 

atgtgcaaga gggagagtgg tg 22 
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<210> 377 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 377 

tgactgagag gactgcaaag gg 22 



<210> 378 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 378 

gcctgatctc tgatgccaaa taag 24 



<210> 379 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 379 

tttgccattc cagaagccag 20 



<210> 380 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 380 

gatctgtgtg atgtcgaggc ttg 23 



<210> 381 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 381 

gtccacgttt ccagaacacc c 21 



<210> 382 
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<211> 22 



<212> DNA 



<213> Homo 



sapiens 



<400> 382 



cgaaatccca 



aagacacaga cc 



22 



<210> 383 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 383 

gagttgctga agctgcggta ag 22 

<210> 384 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 384 

gaaaaaggga gcttctgtgc ate 23 

<210> 385 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 385 

aacaagggga cagggactgg 20 

<210> 386 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 386 

cagaaacctg ggagcagatc c '21 

<210> 387 

<211> 24 

<212> DNA 

<213> Homo sapiens 
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<400> 387 

gcaacaagag gacatgcacc taag 24 



<210> 388 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400>. 388 

aaaggtagag gacatgccaa age 23 



<210> 389 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 389 

ggagcagcca acaactcgtt c 21 



<210> 390 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 390 

gctcatttaa tccccacaac acc * 23 



<210> 391 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 391 

ccaccacacc tggctaattt ttg 23 



<210> 392 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 392 

tgtagctcac cctctggact ttg 23 
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<210> 393 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 393 

aatatgcaac cctcccctgc 

<210> 394 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 394 

tgaggcagaa gaatcgcttg aac 

<210> 395 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 395 

acttgtcatt ggctgtcccc tc 

<210> 396 

<211> 2*1 

<212> DNA 

<213> Homo sapiens 

<400> 396 

tttgcctgca ctgtgctttt g- 

<210> 397 , 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 397 

ccatactcag catcctgcac tec 
<210> 398 
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20 



23 



22 



21 
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<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 398 

aacgacagca accagggtgg 20 



<210> 399 
<211> 23 . 
<212> DNA 
<213> Homo sapiens 

<400> 399 

cagcaggtgt atctaatggc agg 23 



<210> 400 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 400 

agagtggtgg ggagatgagg tg 22 



<210> 401 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 401 

tgcaaagggg cagactagag g 21 



<210> 402 
<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 402 . 

cgaccacttt tatgggagga gc - 22 



<210> 403 

<211> 23 

<212> DNA 

<213> Homo sapiens 
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<400> 403 

ccaggtgttc tgaaccacac ttc 23 



<210> 404 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 404 

tgtgtgatgt cgaggcttgt acc 23 



<210> 405 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 405 

gaatgcaggg aagagaaggc ag 22 



<210> 406 

<211> 22 

<212> -DNA 

<213> Homo sapiens 

<400> 406 

gccatcagga catggtgatt tc 22 



<210> 407 

<211> 22 

<2X2> DNA 

<213> Homo sapiens 

<400> 407 

tgagcactcg tacttcccga ag 22 



<210> 408 

<211> 22 ■ 

<212> DNA 

<213> Homo sapiens 

<400> 408 

tgagagcagc agggatgact tc 22 
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<210> 409 

<211> 23 

<212> DMA 

<213> Homo sapiens 

<400> 409 

aaactcctga tagccactgg tgg 

<210> 410 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 410 

agatcctccg ccgaaatgtc 

<210> 411 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 411 

ttactcttct ctgagatgcc cgag 

<210> 412 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 412 

tgggcagtgg cattgtctac tc 

<210> 413 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 413 

ttccaggagg tggcatttcc 
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